This article orginally appeared in the July-Aug 1994 issue of Language Industry Monitor One of the world’s premier industrial NLP labs is launching its technology on the market. With the unveiling of Xerox Lexical Technology (XLT), Xerox is the most recent newcomer to the increasingly competitive OEM market for linguistic software. XLT is a toolbox which offers morphological reduction and generation, morphological derivation, part of speech disambiguation, and tokenization functions for a number of languages. XLT will be licensed in packs of three languages, with English-French-German available now, and other European languages, including Spanish, Italian, and Portuguese, available shortly. Eventually, Xerox plans to extend support to include Japanese, Korean, and Chinese as well. XLT was developed at Xerox’s renown PARC laboratories in Palo Alto, California, where researchers Martin Kay, Ron Kaplan, Lauri Karttunnen, and others have virtually dictated the direction of computational linguistics over the past fifteen years. Based on the insight that many lexical processes, such as morphological analysis and generation, could be described in terms of finite-state mechanisms, Xerox PARC’s researchers have spent the past decade building an array of so-called finite state transducers, bidirectional programs (they handle both analysis and generation) which represent all possible morphological mutations of a word in terms of mathematical relations. Xerox says these linguistic modules are clean, compact, efficient, and uniform, and the code which runs these is very small and language-independent. Morphological processing is one of the comer’ stones of language processing and it is likely to be found in virtually all mainstream software within a few years. Despite getting a late start, will Xerox nonetheless be able to move quickly and aggress, ively enough to capture a piece of this action? While few will quibble that Xerox’s technology is fundamentall y superior, Xerox must also offer commensurate breadth, both in terms of lexical coverage and the number of languages it supports, and this means lots more not so exciting lexicon coding. Xerox will also be faced with convincing potential takers that the Xerox technology has more than just first,class theoretical pedigree; the company is entering an environment where ad hoc solutions dominate. The prevailing attitude towards linguistics among software engineers who have had to deal with it can best be summed up as: “less is ” more. Xerox Corporation, Advanced Office Document Services, 3400 Hillview Ave, Palo Alto, CA 94304, USA; Tel: + 1 415 813 6804, Fax: + 1 415 813 6792 |