This article orginally appeared in the Nov-Dec 1993 issue of Language Industry Monitor Tools, rules, and lots of text: the RELATOR project will make more of these vital resources available to industry and academia. While modestly funded (ECU 500,000), the LRE RELATOR project is nonetheless a timely new undertaking to set up a basic repository for written and spoken linguistic data, rules, and tools in Europe. As developers are well aware, building robust NLP and speech applications with wide coverage requires enormous amounts of raw data. The Linguistic Data Consortium (LDC) at the University of Pennsylvania has been addressing this need by gradually making such materials available for English and a growing number of other languages, but it nonetheless makes sense for a similar kind of organization to be established in Europe. A European counterpart to the LDC would in principle be better placed to respond to the needs of European developers; it would also be the logical channel for the distribution of European materials. |