One of the next frontiers of search is taking all of the unstructured data spread helter-skelter across the Web and treat it like it is sitting in a nice, structured database. It is easier to get answers out of a database where everything is neatly labeled, stamped, and categorized. As the sheer volume of stuff on the Web keeps growing, keyword search keeps getting closer to its breaking point.
Adding structure to the Web is one way to make sense of all that data, and Google is starting the tackle the problem with a Google Labs project called Google Squared, which Marissa Mayer mentioned earlier today at the companyâ€™s Searchology briefing.
Google Squared extracts data from Web pages and presents them in search results as squares in an online spreadsheet.This type of technology has obvious applications for many types of targeted searches, including product search, health search, scientific searches, you name it. There are dozens of semantic search startups trying to impose structure on the Web to perform similar tricks. Another high-profile search startup which is launching on Monday,Â Wolfram Alpha, takes a slightly different approach in that it simply ingests massive amounts of information into its own databases where it can query it to its heartâ€™s delight. Already there is a bit of aÂ rivalry between Google and WolframÂ because getting back structured results is a major new direction for search.
Wolfram does a pretty good job parsing the information in its own databases, but those databases will never match what is available on the Web. Wolframâ€™s databases currently store only 10 terabytes of information, a tiny fraction of what is on the Web. (I will be posting my impressions of Wolframâ€™s search engine soon). Google Squared is an early attempt to take the messy data which exists on the Web and place it into simple tables. It is still very experimental and isnâ€™t always on target, but you can see where this is going. Turning the Web into a giant database will crush any attempt to segregate the â€œbestâ€ information into a separate database so that it can be processed and searched more deeply.
In the video demo below, a search for â€œcameraâ€ sorts the results in different columns by images, description, and manufacturer, resolution, etc.. You can refine results by clicking on a particular column such as manufacturer. A search for â€œrollercoastersâ€ sorts results by name, image, description, height, length, and number of inversions. But sometimes it gets confused. A search for â€œspaceshipsâ€ turns up a Corvette and a missile carrier. It is going to be a while before this makes it out of Google Labs