Google's attempt to automatically turn unstructured data into structured content in a spreadsheet like format is now live from Google Labs. It's called Google Squared. You can access it here.
Unlike Wolfram|Alpha where data is curated by W|A staff, the data accessible via Google Squared is not curated from a variety of sources but rather automatically culled and organized from various "open web" sites. The source for the data can be found by highlighting a box. We think Google should make the source easier to find. We just stumbled upon it. Researchers needs to know where the data is coming from.
Here's an excellent Google Squared page for Roller Coasters. You can also easily add more columns of data to the square.
Where does the data come from?
A variety of sources. One we noticed coming up again and again was Wikipedia. Makes sense since Wikipedia has some structure to it.
Wikipedia is ok if you understand Wikipedia's strengths and weaknesses. Not all users do. Another issue we wondered about is if the snippets found in the various squares are updated in real-time or a regular basis when changes are made to the Wikipedia page. In other words, is Google using a live version of the Wikipedia database?
Here are a few more "Squares" that we think illustrate the fact that Google Squared is still a part of Google Labs. In other words, more work needs to be done moving forward.
+ A Google Squared search for Chicago. Note how some data is missing and it lists the Trump International Hotel and Tower to be in Ft. Lauderdale. When you click on the link that appears when your cursor over it, you go to a page about a Trump building in New York City. Also, take note of the first set of boxes for the Adler Planetarium. The description and location boxes are not very accurate.
+ A Google Squared search for "Google" does a nice job listing some Google services but beyond that most of the other options are blank or make little sense. For example, the employment box for Google Health reading, "I'm Afraid I Can't Let You Do That Dave." (-: Btw, a search for Twitter is also a bit off the mark.
+ A Google Squared search for hamburgers. You might expect the squares to be filled with names like McDonalds, In-N-Out, Fatburger, Wendy's etc. Nope. Just a few hamburger related boxes containing only a tad of information.
Postscript: Several people have asked if you export the file into a CSV or XLS file. From what we can see and have read elsewhere, the answer is no, not at this time.
A family of resources to help information workers be more effective, raise the value of information in their organisations and contribute to success. Read more »
Recently I have found myself cooing over visualisation maps (and heat maps) of health and well being resources. The content rich data is overlayed with mapping technologies, and some interesting themes and patterns are emerging.
A lot of the talk around social media in the last year has been around information overload. Social media has provided us with new and exciting ways to create content. But it has also meant learning new ways to manage and engage with social media tools. Are we teetering on the edge of an information overload precipice?
Information overload is a figment of your imagination. Or a failure of your filter. Or a symptom of your technological submissiveness. Depends on who you ask.
What if you had to sort through 3.5 million articles and social media posts a day and try to pull out the most relevant items for your organisation? What if you then had to cobble it all together into something readable for your top groups and executives in your organisation?
Alacra Compliance saves time by aggregating information from both free and fee-based sources and enabling users to conduct an accurate federated search across these sources (coined “simultaneous search” by Alacra).