Search
Now showing items 1-2 of 2
Learning content patterns from linked data
(CEUR-WS.org, 2014)
Linked Data (LD) datasets (e.g., DBpedia, Freebase) are used in many knowledge extraction tasks due to the high variety of domains they cover. Unfortunately, many of these datasets do not provide a description for their ...
µRaptor: A DOM-based system with appetite for hCard elements
(CEUR-WS.org, 2014)
This paper describes µRaptor, a DOM-based method to extract hCard microformats from HTML pages stripped of microformat markup. µRaptor extracts DOM sub-trees, converts them into rules, and uses them to extract hCard ...