Now showing items 1-1 of 1

  • Four Heuristics to Guide Structured Content Crawling 

    Umbrich, Jürgen; Harth, Andreas; Hogan, Aidan; Decker, Stefan (2008)
    Search engines focusing on particular media types face difficulties in discovering suitable URIs on the Web. Since the engines are only interested in a small fraction of the Web, a crawler should use heuristics to concentrate ...