Receive the weekly sampler of posts and "Resource of the Week".
Subscribe »

Enter your
email address:

My Account »


Bookmark and Share

Testimonial?
If you find ResourceShelf useful, please supply a testimonial »








Home > ResourceBlog > Article

« All ResourceBlog Articles

 

Bookmark and Share   Feed

Monday, 7th January 2008

Wikia Search Arrives Online: A Look at Wikia Search and Wikia the "Hub" Site

UPDATE: We need to be a bit clearer. We are looking at both Wikia Search and Wikia.com, two separate sites with two separate search tools.

It's always exciting when a new search engine comes online and today is one of those days with the release of the Wikia web search engine.

Opening Overview
It's really hard to say that much since we haven't had much time to use it. Wikia did not offer ResourceShelf preview access but the bottom line is ONLY TIME WILL TELL the power Wikia gains as a search tool which of course will be based at least in part on the quality of results (and related features) it offers users.

Of course, searching the web is not only about delivering results. That's easy. There is a lot of content out there but also how are the results:
+ Relevance vs. search terms (not easy to do given many factors including people reverse engineering the system*** and trying to manipulate).
+ Organization of results
+ Scope of results
+ Privacy and Security Issues (Wikia discusses these issues here)
+ "Value added" features, Advanced search features
++++ Related materials in a single location (Google Universe, Yahoo OneSearch, Ask 3D, etc.)
+ Reputation(s) of source(s)
+ Other traditional database and web search metrics

Again, today is only day one (it's essential to stress this point) for Wikia Web Search so it's really difficult to judge what the service will be like in 2 weeks, 3 months, 6 months and/or a year from today.

It's also important to remember that no one search engine is perfect or ideal for every search. At ResourceShelf, we're about using the right engine or database at the right time and often using more than just to double or triple check what you find.

Wikimedia juggernaut co-founder*, Jimmy Wales, was on the $$$ when he told NPR in 2006 to take what you find in Wikipedia with a, "grain of salt". This is the case with many search tools (including Wikia, other databases, and even books) and why using more than one resource is a good idea. Think journalist.

When we have written about Wikipedia and other volunteer efforts we have asked will:

+ How social? Will people stay involved or will they move on to the next big thing? True, Wikia is a commercial venture but from we can see TODAY, a great deal of the original content comes from volunteers like Wikipedia offers.

+ Will outside sources*** try to manipulate content and how results appear? If so, will the wisdom of crowds know best? This is an issue for all search engines.***

+ Will people know about and use this product?

* The other co-founder of Wikipedia, Larry Sanger, left Wikipedia and now leads another organization and online knowledge tool named Citizendium.

*** This is not a good or bad thing but the way it works. So, get used to it. That said, a little bit of training and education can go a long way (minutes and hours NOT days and months and semesters) into making a better searcher.

While info pros look at web search as a research tool (like Dialog, LexisNexis, Factiva, and others), many look at it as another "form" of research and better said, marketing. They want their results at the top. Period. Most people don't look beyond result number 5 or 6 and for sure not past the first ten.

Before offering a brief look at Wikia Web Search as of today, here are a few related posts.

More Reading
+ More on Wikia and Grub; Lucene Will Be Used as Search Engine

+ LookSmart Will Power Wikia Advertising

+ History of the Project


So, Where's the Search?

There is NO search box on the Wikia home page as of today, just a link to Wikia Search. That page is found here.

It's an early release. Note the word alpha in the URL. Kudos to Wikia for being upfront about this. To learn more you'll need to create and account and learn more. To search, just enter your search terms.

To work on editing, commenting, and being a part of the community, you'll need to register. It's free.

Results. Fast Facts (aka what we're seeing as of 9am EST).

Registered users can discuss each and every results page.

+ Here's a search for "Newspapers."

+ Some results will have Mini Articles. We ran 20 searches (common terms) and did not find any as of today.

+ Pages are cached locally.

+ Results are not numbered.

+ Next to each result is the Nutch (what powers Wikia Search) score that links to a page for the specific result and for the search term. For the most part, this will be of interest to only the search geek or info scientist. Of course, SEO's might also find it very useful.

+ We were unable to quickly find an adult "safesearch filter" and did find adult content by just entering search terms. Of course, third party filters will block some of this content but remember, like web search NOTHING is perfect especially filters. No filter will get it all be it from a web engine or third party.

+ No paid listings/sponsored links on results pages as of today.

+ there is no spell check. Example #2. This is a good example of why it's an alpha release.

+ A search for Star Trek
+++ A mini result (no result for Star Trek is listed)
+++ Results with cache and Nutch scores along with the ability to vote (1-4 stars for each result, not working at the moment).
+++ Ability to use search than one index (Four indices are available testing)
+++ Community: People who have expressed an interest in Star Trek
+++ Community: Registered users have many options (privacy settings) dealing with who sees or doesn't see what you write or images you post.
+++ 10 results per page. To see more results just click. Each set of results is kept on the page. So clicking a few times you create a single page with 30 or 40 results.

What kind of searches does Nutch support? (quoted, nested, truncation, wildcarding [and where], Boolean)

"...." (phrase search?), + (what is this for?), - (negation) and fieldname:term. No "AND" or "OR". The and-logic is implied.

More on Nutch itself on this page.

We are not going to comment on the quality of the results today. Let's give them 4-6 weeks and see how they look then compared to what we see today. We will be taking some screen caps today.

Some other searches we tried. Feel free to review the results for yourself. We will take another look at these and other searches in the coming weeks.
+ "Time Magazine"
+ war in iraq
+ science fair projects
+ Britney Spears
+ Flickr mashup mac
+ Capital of Washington state
+ Hepatitis C

Since we're writing on Wikia itself today, let's take a look at what resources these sites provide.
For Wikia Search (what's getting all of the attention today, that's posted above).

Please, Let's Not Forget
Numerous EXCELLENT non-commercial directories and databases of information (both general and subject specific) are out there being built and maintained by info pros and scholars. They are unfortunately often overlooked. We don't want to overlook them today. A few examples include:

+ Librarians' Internet Index
+ Intute
+ Internet Public Library
+ Infomine
+ HomeworkNYC
+ GlobalEdge (International Business)
+ ELIDS (Global Development)
+ TechExtra (Info Techonology)



The Wikia Site and Databases (not Wikia Search)
It's important to remember that Wikia itself has been around for some time (approx. 2004) as a place where people could build and host topical wiki's. You'll see them organized on a new homepage by category. More about Wikia the wiki production and hosting site here.

Top-Level (aka Wiki Hubs) Categories include:
+ Cartoons & Comics
+ Education
+ Fan Fiction
+ Finance
+ Gaming
+ Health
+ Imagination
+ Politics & Activism
+ Science & Nature
+ Science Fiction
+ Sports
+ Technology

Under each category are wiki's on specific topics. For example, Finance includes:
++ Entrepreneur
++ Investment
++ Shopping
Of course, just like Wikipedia these pages can be edited by registered members (free). After spending a bit of time browsing the site, Wikia seems to be especially strong in pop-culture topics. Again, don't forget to review what you can find in some of the non-commercial directories listed above.

Back to Wikia.com
So, let's start from the homepage and select Health.

Now, look in the upper right hand corner for a search box with the terms "Search Wikia" in it.

There you go. Enter your terms and your're underway.

Now, let's look at a result page where you will find four sets of results.

Here's a search for George Washington.
Note: Quotation marks do not appear to work with two (This WIKI and ALL WIKIA) of the four tabs.

The results are labeled:

+ This Wiki (if you are searching a specific wiki or hub
+ All Wikia
+ Wikipedia (uses Google Custom Search)***
+ The Web (results from all over the web, using Google Custom Search)

*** Avi Rappaport pointed out this page from Google noting issues with Google Custom Search using more than 3 domains and how the results might not be complete.

Now, with the major caveats, that:
+ This is day one
+ Running a few test searches is really not the best way to test an engine
+ Results may vary
+ Since the quality of the results will vary from person to person, you be the judge.

Let's review what you see when you search Wikia for a few items. Let's start with:

"Steve Ballmer"

This Wiki
No results. If we were searching for an info tech or wiki about computing we might find some results.

All Wikia
Result #1 is in Spanish, Result #2 is in German, and Result #3 is from a satirical wiki.
Overall, it was a challenge to find even a brief English language bio.

Wikipedia via Google Custom Search
Result #1, a Wikipedia page about Ballmer (what you would expect in this spot)
Result #2, a Wikipedia entry titled "Dance Monkeyboy"
Result #3, a Wikipedia entry titled "Team OS/2"

Web Results via Google Custom Search
Result #1, Same Wikipedia page that's number one in the Wikipedia tab
Result #2, a YouTube video titled, Steve Ballmer Going Crazy
Result #3, His official MSFT Bio

Next
Now, let's try a search for Dinosaurs.
Gone is the tabbed results page. Here users see only links to specific subject wiki's.
Other examples would be Star Trek and Video Games.

Pakistan
Again, the 4 tabbed view since their is no specific Wikia for this topic. We also noticed that a search for "Pakistan Politics" a phrase returns zero items in the This Wiki and All Wikia tabs. The same is true for other phrases and names.

Example: NASCAR triggers a "hub only" page when searching from the upper-right hand page "Search Wikia" box.

+ George Washington
+ "Science Fair Projects"
+ "European Commission"
You do get results vis the GCSE under The Web and Wikipedia tabs and users are often invited to create a Wikia hub if one is not available.

We were also unable to find any spelling correction or suggestions in some cases. Note the incorrect spelling of Star Trekk vs.
Star Trek or Chicago vs. Chiccago. Again, this is day one so perhaps this it's coming soon?

Review and Questions

+ We were surprised not to spot a search box on the home page.

+ The way the web results look today is likely not the way they will appear over time. Of course, all engines are constantly tweaking and tuning their algorithm plus always growing their databases. So, this is nothing new. What is new is that the masses will be able to participate in one way or another.

Wales' approach would open that [algortithm details] process. Initially, participants will help make such decisions as whether a site on "Paris Hilton" refers to the celebrity or a French hotel.

+ As of today, the Wikipedia and The Web Search Tab produce results powered by Google Custom Searches. Again, as we pointed out earlier***, Google documents "issues" with using more than 3 URL's in a custom search. Perhaps, the actual search engine Wikia uses is still in development?

+ Will people use it (the web search + the Wikia search) in the first place and also participate in making the results better? Are the masses looking for more to do or just want good results. Period.

+ Will Wikia be able to live up to its goals (over some period of time)?

+ Will knowing how the engine operates (what has been called the "secret sauce" by Chris Sherman) algorithmically make it even easier to reverse engineer/manipulate or perhaps a better word, "game?" Danny Sullivan talks about the issue here.


Category:

Views: 566




blog comments powered by Disqus

« All ResourceBlog Articles

 

Read about the FreePint FamilyFreePint Family

A family of resources to help information workers be more effective, raise the value of information in their organisations and contribute to success. Read more »


FeedLatest Family Articles:


Click to view the article Quilting big data threads
Thursday, 24th May 2012

Recently I have found myself cooing over visualisation maps (and heat maps) of health and well being resources. The content rich data is overlayed with mapping technologies, and some interesting themes and patterns are emerging.


Click to view the article The fallacy of information overload
Wednesday, 23rd May 2012

A lot of the talk around social media in the last year has been around information overload. Social media has provided us with new and exciting ways to create content. But it has also meant learning new ways to manage and engage with social media tools. Are we teetering on the edge of an information overload precipice?


Click to view the article Information overload: fact, fantasy or filter failure?
Wednesday, 23rd May 2012

Information overload is a figment of your imagination. Or a failure of your filter. Or a symptom of your technological submissiveness. Depends on who you ask.


Click to view the article Newsdesk: tracking millions of pieces of information a day
Tuesday, 22nd May 2012

What if you had to sort through 3.5 million articles and social media posts a day and try to pull out the most relevant items for your organisation? What if you then had to cobble it all together into something readable for your top groups and executives in your organisation?


Click to view the article Alacra Compliance adds managerial oversight
Tuesday, 22nd May 2012

Alacra Compliance saves time by aggregating information from both free and fee-based sources and enabling users to conduct an accurate federated search across these sources (coined “simultaneous search” by Alacra).


All Family Articles »
Family Articles by Category »


Tell us what you're working on,
and we'll talk to you about how FreePint can help »


FreePint Family Testimonials

"Fabulous resource to learn of unique tools and insights. Very useful." Manager, Futures and Forecasting, Virginia, USA

More testimonials »






Subscribe

Subscribe to the ResourceShelf Newsletter and receive the weekly sampler of posts and Resource of the Week.

Find out more »

ResourceShelf sponsored by:

Article Categories

All Article Categories »

Archive

All Archives »