IBM
has already got an enterprise-class search product in
OmniFind
, so why is it partnering with Yahoo to provide a free version of the
same? The answer is simple: IBM wants to attack Google’s Mini search hardware in
the small business marketplace.
But don’t think of
IBM
OmniFind Yahoo Edition as just a toy version of the real product,
or a loss-leader to grab attention and direct users to an expensive upgrade
path. IBM OmniFind Yahoo is a serious tool. It might be free, but it’s one lunch
that looks likely to satisfy, courtesy of the capacity to index half a million
documents and an ability to combine results with Yahoo-powered web searches.
The “three-click installation” promise is no hype. It really is a doddle to
install – just don’t expect it to be a quick doddle. Although installation
itself took 15 minutes via the Java GUI, the entire afternoon was spent in
building a 3.5GB index of the 50,000 documents found on the local drives and
network drives mounted on IWR’s test search server.
Be aware that user access policies will be ignored, so the administrator must
take care not to index restricted data nor restrict access to the search itself
(and its cache).
IBM OmniFind Yahoo
failed to index some documents because of a lack of support for the
file format, including such common types as JPG, MP3 and TIF. Some 200 file
types are searchable (in 30 languages), including DOC, HTML, PDF, XLS, XML and
ZIP.
Look out for the maximum size limit if you deal with larger documents –
anything bigger than 51.2MB and OmniFind Yahoo Edition barfs. The other major
size restriction comes with the total number of documents in the index, but
given that this has been set at 500,000 that really shouldn’t matter.
EXTENDABILITY
Because it is built on an Apache Lucene framework and has an open URL-based
Representational State Transfer (REST) API, it is easy enough for the
technically minded to deliver results as Atom feeds or embedded HTML snippets
within a web page. The whole system has been developed from the ground up with
custom search-based application integration in mind, making it easy to create
extensions to leverage the core engine with third-party applications.
Accuracy and speed are proof of the search value pudding. Surprisingly,
really surprisingly, IBM OmniFind Yahoo satisfies on both counts. Not only did
it return relevant and plentiful results, it was also lightning quick in
bringing them up. Compared to off-the-shelf SQL query-based software, this is
light years ahead.
While the default interface is the familiar one, it is easy enough to rebrand
by removing the Yahoo links, logos and colour schemes and replacing them with
your own. It is just as easy to add your own synonyms to the search engine,
either importing them or adding them manually, as well as creating your own
sponsored links in Google style so that certain keywords always return an
associated URL at the top of the search results.
The lack of multiple index support is disappointing, as you can have only a
single collection of documents rather than using a single engine to index
multiple sites as separate entities. But then IBM OmniFind Yahoo is clearly
intended to be an entry-level product, rather than a rival to the mid-tier
players. Nonetheless, Google
must be quaking in its boots at the prospect of a totally free solution
that outperforms its own rather expensive
Mini
entry-level product.
With all this ease of use and flexible familiarity it is easy to forget that
IBM OmniFind Yahoo is a serious enterprise-level tool. You will need to deploy
this optimised enterprise search server on server-class hardware, and use all
available resources to speed up the crawl rate while delivering quick responses
on multiple simultaneous queries from a myriad users.
The ease of installation, administration and use means there is no need to
spend a fortune on support “wine” to accompany this free lunch. It is there if
you want to imbibe (£1,369 excluding VAT per year per server), but seeing as the
free online support forums are so good there appears to be little point.
What does take the edge off the appetite a little is that IBM OmniFind Yahoo
is only really “free” if you happen to have the relevant server hardware with
spare capacity available. Unlike the Google Mini, it is not a hardware-and-all
plug-and-play solution. Factor in the cost of a server with a couple of fast
processors, enough RAM and mirrored drive space, and the cost comparison becomes
rather more complex.
Do you agree?
Have your say on this article