Quantcast

Basis Technology Powers Mark Logic Entity Enrichment

October 7, 2008

Basis Technology Corp. (www.BasisTech.com) announced today that Mark Logic has licensed its Rosette Entity Extractor (REX) for thirteen languages to enrich the XML of a document by locating concepts that cannot easily be found through simple keyword matching. Built into MarkLogic Server version 4.0, its customers can enrich content automatically, in order to develop powerful applications that rely on more sophisticated structured queries and analytics.

MarkLogic Server is the industry’s leading XML server, and is based on a modern architecture designed specifically for processing XML. This flexible, open platform enables rapid creation of more interactive, richer applications, making MarkLogic the best place to natively store, manage, search and dynamically deliver information. MarkLogic helps customers react to market changes conditions quickly by identifying new revenue opportunities, building fresh and innovative products, and streamlining the delivery of information.

The Rosette Entity Extractor identifies many different types of entities, including person, organization, location, credit card number, email address, latitude/longitude, date, time, among others from unstructured data. It locates generic terms as well as specific references and determines sentence boundary and the part-of-speech of each word using advanced linguistics. These features enable the process of identifying and extracting important information from documents, adding greater structure to the information, which can then be further analyzed by MarkLogic Server.

REX currently extracts entities in Arabic, Chinese, Dutch, English, Farsi, French, German, Italian, Japanese, Korean, Russian, Spanish, and Urdu with more languages under development.

“By integrating the Rosette Entity Extractor into MarkLogic Server, we’ve added advanced text mining capability to support our customers’ requirements for fine-grained search and analytics,” said Jason Monberg, vice president of product management for Mark Logic Corporation. “With this release we are introducing several significant enhancements to our XML server and are pleased to have worked with Basis Technology to provide the entity enrichment feature for our customers.”

REX is designed for integration into software systems for information retrieval, text mining, relationship extraction, business intelligence, military intelligence, e-commerce and other applications that classify, analyze, and mine textual information.

About Mark Logic Corporation

Mark Logic Corporation is the provider of the industry’s leading XML server. The company’s flagship product, MarkLogic Server includes a unique set of capabilities to store, manage, search and dynamically deliver content. The company has two patents on its innovative technology, and is privately held with Sequoia Capital as its lead investor. To read the Mark Logic CEO Blog, visit marklogic.blogspot.com. To learn more about Mark Logic, or to download a free community or trial edition of MarkLogic Server, go to www.marklogic.com.

About Basis Technology

Basis Technology provides software solutions for text analytics, information retrieval, and name resolution in many languages. The company’s Rosette(R) Linguistics Platform is a widely adopted suite of interoperable components that delivers high-performance results to search, business intelligence, e-discovery, and many other enterprise applications. Basis Technology is on the forefront of applied natural language processing solutions using a combination of statistical modeling, expert rules and corpus-derived data.

Leading software vendors, content providers, financial institutions, and government agencies rely on Basis Technology’s solutions for Unicode compliance, language identification, multilingual search, normalization, name matching, name translation, and entity extraction. Our products and services are used by over 250 major firms, including Cisco, EMC, Endeca, HP, Microsoft, Oracle, and Symantec. Our text analysis products are widely used in the U.S. defense and intelligence industry by such firms as CACI, Lockheed Martin, MITRE, Northrop Grumman, SAIC, and SRI. We are also the top provider of multilingual search technology to web search engines, such as AOL, Ask.com, Google, Windows Live, and Yahoo!

Company headquarters are in Cambridge, MA, with branch offices in San Francisco, California; Herndon, Virginia; and Tokyo, Japan. For more information, visit www.BasisTech.com or call 800-697-2062.




comments powered by Disqus