Home > Internet > NYTimes : New Search Technologies Mine the Web More Deeply

NYTimes : New Search Technologies Mine the Web More Deeply

February 23rd, 2009

One day last summer, Google’s search engine trundled quietly past a milestone. It added the one trillionth address to the list of Web pages it knows about. But as impossibly big as that number may seem, it represents only a fraction of the entire Web.

Beyond those trillion pages lies an even vaster Web of hidden data: financial information, shopping catalogs, flight schedules, medical research and all kinds of other material stored in databases that remain largely invisible to search engines.

The challenges that the major search engines face in penetrating this so-called Deep Web go a long way toward explaining why they still can’t provide satisfying answers to questions like “What’s the best fare from New York to London next Thursday?” The answers are readily available — if only the search engines knew how to find them.

Now a new breed of technologies is taking shape that will extend the reach of search engines into the Web’s hidden corners. When that happens, it will do more than just improve the quality of search results — it may ultimately reshape the way many companies do business online.

New Search Technologies Mine the Web More Deeply - NYTimes.com

Blogged with the Flock Browser

Internet

  1. No comments yet.
  1. July 1st, 2010 at 16:15 | #1