UK Web Focus

Reflections on the Web and Web 2.0

Archive for the 'search' Category


Should We “Leave Search To Google?”

Posted by Brian Kelly (UK Web Focus) on 21 April 2008

When I chaired the session on Search at the Museums and the Web 2008 conference the discussion, as I described in a recent post, turned to lightweight approaches to federated searching. During the session I received a Twitter comment on my feedback channel (intermingled with the football scores!) asking “is it more useful to develop compelling browse interfaces & leave search to Google?” The response at the time seemed to be that although Google might have a role to play in the future, its role at present is limited (in a museums’ context) due to the complexities of typical collections management Web interfaces: the valuable data is part of the ‘deep Web’ which search engines such as Google find difficult to index.

But just a few day’s ago, via a comment made by Nate Solas on his blog post about the Search session, I discovered that Google have announced their intention to index the deep Web:

This experiment is part of Google’s broader effort to increase its coverage of the web. In fact, HTML forms have long been thought to be the gateway to large volumes of data beyond the normal scope of search engines. The terms Deep Web, Hidden Web, or Invisible Web have been used collectively to refer to such content that has so far been invisible to search engine users. By crawling using HTML forms (and abiding by robots.txt), we are able to lead search engine users to documents that would otherwise not be easily found in search engines, and provide webmasters and users alike with a better and more comprehensive search experience.

Mia Ridge has commented on the implications of this announcement:

You’re probably already well indexed if you have a browsable interface that leads to every single one of your collection records and images and whatever; but if you’ve got any content that was hidden behind a search form (and I know we have some in older sites), this could give it much greater visibility.

In light of Google’s announcement it is timely, I would think, to revisit the question “It is it more useful to develop compelling browse interfaces & leave search to Google?” Imagine the quality of services we could provide if we redirect resources from replicating search algorithms which have already been developed (”standing on the shoulders of giants”).

And let’s remember (a) the evidence which suggests that users prefer simple search interfaces and (b) the costs of attempting to compete with Google in the search area - let’s not forget that, despite their riches, Microsoft haven’t been able to compete successfully. Is it likely that search technologies developed by tax-payers’ money will succeed where Microsoft have failed?

PS I should probably add that I’m not the first to suggest this idea. The OpenDOAR team, in particular have deployed a search interface using Google across institutional repository services. Many congratulations to the team at the University of Nottingham for evaluating this lightweight approach.

Posted in search | 26 Comments »

A New Search Interface for HERO

Posted by Brian Kelly (UK Web Focus) on 26 September 2007

I have been reading the September issue of the HERO Headlines magazine, which provides “the latest news from HERO Ltd, the company behind the UK’s official online gateway to higher education and research opportunities“.

An article in the magazine describes the release of a search tool which can be added to Internet Explorer and Firefox browser to enable the HERO.ac.uk Web site to be searched directly from the browser, without first having to go to the HERO Web site. Use of this search facility to search for articles about UKOLN is shown in the diagram.

Search for 'UKOLN' on Hero Web site

At one stage there was a tendency in various Web development circles that browser-specific enhancements should be avoided, as they don’t necessarily provide universal solutions (in this case, users of the Opera browser may feel disenfranchised). I don’t go along with this argument - I feel that this provides a richer and easier-to-use solution for many users, whilst still allowing users of more specialist browsers (or old versions of Internet Explorer or Firefox) to search the Web site in the traditional way.

Congratulations to HERO for this development. Now how many institutions are configuring their browsers with similar search interfaces for their institutional Web site, I wonder?

Posted in browser, search | 6 Comments »