Browsing Data Science Institute (Conference Papers) by Subject "Uniform Resource Identifiers"
Now showing items 1-1 of 1
-
Four Heuristics to Guide Structured Content Crawling
(2008)Search engines focusing on particular media types face difficulties in discovering suitable URIs on the Web. Since the engines are only interested in a small fraction of the Web, a crawler should use heuristics to concentrate ...