Now showing items 1071-1071 of 1071

    • µRaptor: A DOM-based system with appetite for hCard elements 

      Muñoz, Emir; Costabello, Luca; Vandenbussche, Pierre-Yves (CEUR-WS.org, 2014)
      This paper describes µRaptor, a DOM-based method to extract hCard microformats from HTML pages stripped of microformat markup. µRaptor extracts DOM sub-trees, converts them into rules, and uses them to extract hCard ...