Scopul nostru este sprijinirea şi promovarea cercetării ştiinţifice şi facilitarea comunicării între cercetătorii români din întreaga lume.
Domenii publicaţii > Ştiinţe informatice + Tipuri publicaţii > Capitol de carte
Autori: Dan Tufiş
Editorial: Ishida, T., Fussell, S.R., Vossen, P, Springer-Verlag, Berlin Heidelberg, Intercultural Collaboration I, LNCS 4568, p.103-117, 2007.
Rezumat:
Parallel corpora encode extremely valuable linguistic knowledge, the revealing of which is facilitated by the recent advances in multilingual corpus linguistics. The linguistic decisions made by the human translators in order to faithfully convey the meaning of the source text can be traced and used as evidence on linguistic facts which, in a monolingual context, might be unavailable to (or overlooked by) a computer program. Multilingual technologies, which to a large extent are language independent, provide a powerful support for systematic and consistent cross-lingual studies and allow for easier building of annotated linguistic resources for languages where such resources are scarce or missing. In this paper we will briefly present some underlying multilingual technologies and methodologies we developed for exploiting parallel corpora and we will discuss their relevance for cross-linguistic studies and applications.
Cuvinte cheie: aliniere, adnotare, colocatii, cercetari cros-linguale, codicicare, dezambiguizare (POS, WSD), corpusuri paralele, tehnologii multilinguale, tagging, wordneturi // alignment - annotations - collocations - cross-language studies - disambiguation (POS and WSD) - encoding - parallel corpora - multilingual technologies - tagging - wordnets
URL: http://www.springerlink.com/content/1l8370w645720vxk/?p=17a3dc817b0a467cb9b958d4cec4aee3&pi=7