Workshop on the Generation of Multilingual Parallel Documents

3 April 2017
Venue: Room 00/A075; Ariane Building; 400, route d'Esch; L-1443 Luxembourg
Organiser: Directorate-General for Translation of the European Commission
http://dragoman.org/pardoc - hashtag: #pardoc
NEW: Slides - MED (template, including programs)

About

This event is a technical workshop on how to automate the production of multilingual parallel documents (pardocs) based on the simultaneous generation of all the linguistic versions from one template and parallel segments.

To avoid misunderstanding: generation of pardocs is not about machine translation (MT) or computer aided translation (CAT). It is about processing already translated parallel segments into documents. MT and CAT might be used in creating parallel segments.

Content of the workshop

The workshop will encompass: base concepts, historical example, manual example, available spartan system, future development of one open source system supporting many document types, and perhaps short talks by authors of position papers. The workshop will conclude with an open discussion.

Multilingual parallel documents (pardocs)

Pardocs are a common output in organisations working in several languages such as European Union organisations. An example of pardocs are Commission Regulations, in this case in 23 languages. The generation of pardocs is well adapted to typified document such as Types of documents in EUR-Lex.

Position papers

Position papers must be related to the subject at hand; they will be taken into account to shape the workshop. Anybody can submit position papers, even non-participants; PDF, HTML and plain text are the preferred formats. Selected authors may be invited to do short presentations. Authors should consider practical document type(s) relevant to their work. Send by the 20 March 2017 to:

  manuel.carrasco-benitez@ec.europa.eu
  Subject: pardoc - paper

Authors submiting position papers and not participating in the workshop must not register.

Profile of participants

The workshop has been designed to be accessible to a wide audience: it is expected that most participants will not have the ideal background in computer science and multilingual document processing. Participants must be interested in the industrial production of pardocs and they should have some experience in at least one of the aspects of Authoring, Translation and Publication (ATP). Participants must be aware of the technical IT nature of the workshop.

After the workshop, participants should have a good understanding on this technique for the generation of pardocs. At the very least, participants should gain ideas applicable to similar situations.

Registration

The workshop is open to all and attendance is free. For registration send an email to:

  manuel.carrasco-benitez@ec.europa.eu
  Subject: pardoc - registration
  Body:
    person_title:
    name:
    affiliation:
    email:
    telephone:+
    position_paper:
    access_pass:

Set person_title to Mr., Ms., etc. Set position_paper to yes if one will be sent. Set access_pass to yes if one is required to access European Commission buildings; an access pass will be provided on the day at the reception on the presentation of a national identify card or passport; participants working for the European Institutions/bodies can use their card. An email will be sent confirming the acceptance/rejection.

To cancel the registration send an email asap from the same address as the one used for the registration to:

  manuel.carrasco-benitez@ec.europa.eu
  Subject: pardoc - cancel
  Body:
    cancel the registration

It is up to the participants to monitor the workshop website for changes, though some communications might also be sent via email.

Schedule

10:30 - 11:25 : Session 1
11:25 - 11:35 : Short break
11:35 - 12:30 : Session 2
12:30 - 14:00 : Lunch break - participants pay for their lunch, available at the canteen
14:00 - 14:55 : Session 3
14:55 - 15:05 : Short break
15:05 - 16:00 : Session 4

Sessions      : 55 minutes
Short breaks  : 10 minutes
Lunch break   : 90 minutes

There will be a continuous flow; hence, sessions are not pre-dedicated to a subject. The language of the workshop is English.