Index Engines Aims to Optimize Content Management with Collection Engine

May 10, 2011

With a demonstration at EMC World-held May 9-12 in Las Vegas-Index Engines released a new platform, the Index Engines Collection Engine, augmenting search procedures for the acceleration of back up processes and the extraction of files in a corporate environment. Collection Engine expends the full use of Data Domain, a back up platform from EMC used to sift through archived data, extracted content, and support litigation compliance. Unlike its collaboration with any other platform, Data Domain now exposes new opportunities for search and back up solutions.

"Prior to this new product, Index Engines extracted records from backup images and passed them to third party platforms such as records management and compliance archives," says Jim McGann, the vice president of Information Discovery, Index Engines. "With this new product, Index Engines extracts records and manages them in a collection. For Data Domain the collection resides on the same server, so Data Domain users can search, find and extract relevant records onto to the same platform."

While the ability to search and index by content is not a foreign concept, the junction of the two platforms allow for more accurate search options. "The Collection Engine works with the Data Domain platform to directly index the data contained in the backup image. The Index Engines platform connects to the Data Domain storage system, indexes the content, and then through the Index Engines GUI, users are able to search the data within the Data Domain backup image," says McGann.

Unique to this product is its ability to operate in terms of collection. "The Index Engines user interface also allows users to save queries and execute them on a regular basis to locate targeted data based on corporate retention policies. The data of interest can then be extracted to a collection area within the Data Domain storage system for future access. Without this capability, users cannot identify or access specific information within the Data Domain storage system," says McGann.

At this heightened level of content management, the process of search and backup brings a new order to data through a usable search interface. "A lot of people are trying to manage their records that get buried in user files and emails and there is a lot of stuff that is not that interesting to keep," says McGann.

In this process, the Collection Engine generates a catalog and index by scanning backup media from which the query application finds files relevant to the search. After emails and files have been searched through queries, the content is copied to the archive. However, it is the Collection Engine that allows the Data Domain platform to be used in such an efficient way. With large volumes of data growing in the corporate realm, the Collection Engine targets specific content through search queries. For tasks such as archiving and the management of security and compliance of records, a detailed collection process is greatly needed.

"We can create a repository that will hold it and manage that content," says McGann. "It's a pretty simple idea. You have to find the proverbial needle in the haystack and the haystack is a complicated place in a corporate environment because there are lots of proprietary formats. We dig deep into that haystack and all sorts of environments and allow you to create a policy that binds data."

With the ability to sift through and manage a high volume of data, there is greater accessibility among users, heightening the opportunities for search among clients.
"It's expandable in the sense that you can just add users and it will go and collect their data. So it can support hundreds or thousands of users within a corporate environment," says McGann. "You usually have to search around networks and find all sorts of other locations for this data. Back up is the sort of thing that every company does. To be able to tap into that and extract from it is really the unique value we provide."