Attensity, a provider of software applications that are designed to seamlessly integrate and analyze both structured and unstructured information, has announced Attensity Workstation, a multi-language desktop addition to its family of unstructured data transformation platforms. Attensity Workstation extracts and transforms facts captured in free-form text into structured data for integration with the tabular data of virtually any existing analytical software package.
Attensity Workstation is the first Attensity product to incorporate the company's new Directed Learning technology, and offers a multi-language capability with technology supporting virtually any single-byte language. In addition, Attensity Workstation offers a new example-based model that is intended to enable any user to master the skills for configuration with as little as 30 minutes of training, for on the fly analysis in response to time-critical requests. After instruction on Attensity Workstation, users can begin to "teach" the system how to find and extract the types of events needed from a corpus of documents. The user need only be conversant in the language of the document set and will then respond to examples derived by Attensity Workstation from the document set and displayed in its training interface. Based on the feedback received from the user, Attensity Workstation derives grammar and extraction rules, and is ready to be run over the data set to output structured data in a form that is loaded into the user's preferred repository, application, or business intelligence solution.
Attensity Workstation's Directed Learning technology is language-independent, capable of supporting text extraction in any language, with broad single-byte language support. Attensity Workstation supports most European (single-byte) and Middle Eastern (single-byte, bi-directional) languages. Asian languages (double-byte languages, such as Chinese, Japanese, and Korean), will be available in the second half of 2005.