Projects - Projects - Ured za istraživanje

Ured za
istraživanje

PROJECTS

Project

Acronym:

LetsMT!

Name:

Platform for Online Sharing of Training Data and Building User Tailored MT
[PROJECT URL | CORDIS URL]

Project status:

From: 2010-03-01 To: 2011-09-01 (Completed)

Contract number:

250456

Action line:

ICT Policy Support Programme, PILOT TYPE B

Type (Programme):

CIP

Instrument:

Project cost:

3.340.000,00 EUR

Project funding:

1.670.000,00 EUR

Project coordinator

Organisation Name:

TILDE SIA

Organisation adress:

VIENIBAS GATVE 75A, 1004 RIGA

Organisation country:

Latvija

Contact person name:

Contact person email:

Croatian partner

Organisation name:

Filozofski fakultet

Organisation address:

Ivana Lucica 3

Contact person name:

Marko Tadic

Contact person tel:

+385 1 6120142

Contact person fax:

+385 1 6156879

Contact person e-mail:

Partners

Organisation name

Country

Short description of project

In recent years, statistical machine translation (SMT) has become the leading paradigm for machine translation. SMT systems are built by analyzing huge volumes of parallel corpus and learning translation models from this data. The quality of SMT systems largely depends on the size of training data. Since the majority of parallel data is in major languages, SMT systems for larger languages are of much better quality compared to systems for smaller languages. This quality gap is further deepened due to the complex linguistic structure of many smaller languages. Languages like Latvian, Lithuanian and Croatian (to name just a few) have complex morphological structure and free word order. To learn this complexity from corpus data, much larger volumes of training data are needed. Current systems are built on the data accessible on the web, but it is just a fraction of all parallel texts. Most of them still reside in the local systems of different corporations, public and private institutions, and desktops of individual users. The cost and the know-how required for building custom MT solutions deter many small-to-medium companies from utilizing the power of MT technologies.

Short description of the task performed by Croatian partner

Design by: M. Mačinković

(C)opyright by Sveučilište u Zagrebu,