Moses-support Digest, Vol 127, Issue 16

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Re: TRAINING_extract-phrases ERROR: malformed XML (Ergun Bicici)
2. EAMT 2017 Call for Participation (Mikel L. Forcada)
3. TC39 - Second call for proposals (Evans, Richard J.)


----------------------------------------------------------------------

Message: 1
Date: Thu, 11 May 2017 22:12:27 +0300
From: Ergun Bicici <bicici@gmail.com>
Subject: Re: [Moses-support] TRAINING_extract-phrases ERROR: malformed
XML
To: moses-support <moses-support@mit.edu>
Message-ID:
<CAB59qTNtyNcWc8eKqnrScE1xe7nr_WQ66vR2ZSG7N93QAMbagw@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

clean-corpus-n.perl can clean XML tags before tokenization:

sub word_count {
my ($line) = @_;
if ($ignore_xml) {
$line =~ s/<\S[^>]*\S>/ /g;
$line =~ s/\s+/ /g;
$line =~ s/^ //g;
$line =~ s/ $//g;
}
my @w = split(/ /,$line);
return scalar @w;
}

Ergun

On Thu, May 11, 2017 at 10:33 AM, Ergun Bicici <bicici@gmail.com> wrote:

>
> Similarly:
> ERROR: some opened tags were never closed: it shares some features in
> common with the SGML < ! [ CDATA [ ] ] > construct , in that it declares a
> block of text which is not for parsing .
>
>
> On Thu, May 11, 2017 at 10:32 AM, Ergun Bicici <bicici@gmail.com> wrote:
>
>>
>> TRAINING_extract-phrases is giving
>> ERROR: malformed XML: Wirtschaftsjahr Betriebsgr?sse < 50.000 kg 120.000
>> kg
>> ERROR: malformed XML: < ! -- / * Font Definitions *
>>
>> etc.
>>
>> this appears to be due to the tokenization of html tags.
>>
>> Is there an option of Moses to handle these?
>>
>> --
>>
>> Regards,
>> Ergun
>>
>> Ergun Bi?ici
>> http://bicici.github.com/ <http://ergunbicici.blogspot.com/>
>>
>
>
>
> --
>
> Regards,
> Ergun
>
> Ergun Bi?ici
> http://bicici.github.com/ <http://ergunbicici.blogspot.com/>
>



--

Regards,
Ergun

Ergun Bi?ici
http://bicici.github.com/ <http://ergunbicici.blogspot.com/>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20170511/b93e27f3/attachment-0001.html

------------------------------

Message: 2
Date: Fri, 12 May 2017 09:09:50 +0100
From: "Mikel L. Forcada" <mlf@dlsi.ua.es>
Subject: [Moses-support] EAMT 2017 Call for Participation
To: moses-support@mit.edu
Message-ID: <2237ff37-1925-7fdf-c195-629cb1d3b603@dlsi.ua.es>
Content-Type: text/plain; charset=utf-8; format=flowed

Call for participation

20th Annual Conference of the European Association for Machine
Translation (EAMT 2017; Prague, Czech Republic)

The European Association for Machine Translation
(EAMT,http://www.eamt.org) invites everyone interested in machine
translation, translation-related tools and resources to attend this
conference. The 20th Annual Conference of the European Association for
Machine Translation, will be held in Prague, Czech Republic from 29 to
31 May 2017, at the Faculty of Mathematics and Physics, Charles
University, Malostransk? n?m?st? 25.

This year's programme includes oral and poster presentations of both
user papers and research papers, a poster session about project and
product descriptions, an invited talk by Jo?o Gra?a (Unbabel), and an
exciting social programme.

The programme is available at https://ufal.mff.cuni.cz/eamt2017/program.php.

For more information, please visit the conference web page at
https://ufal.mff.cuni.cz/eamt2017/.

To register, visit https://ufal.mff.cuni.cz/eamt2017/reg.php .

We look forward to seeing you in Prague!

Conference organisers

General Chair: Mikel L. Forcada (Universitat d'Alacant, Spain)

Track Chairs: Alexander Fraser, research programme chair (
Ludwig-Maximilians-Universit?t, Munich, Germany), Kim Harris, user
programme chair (text&form, Berlin, Germany)

Local Organisation Chair: Ond?ej Bojar (Charles University, Prague,
Czech Republic)

Conference Sponsors

Gold sponsor: MEMSOURCE Translation and Localization Solutions

Silver sponsor: Star Group

Bronze sponsor: text&form

Supporting sponsors: Charles University, Apertium, Prompsit Language
Engineering

Media sponsor: Multilingual magazine

Co-located event:
Social MT 2017: First workshop on Social Media and User Generated
Content Machine Translation (https://sites.google.com/view/socialmt/),
31 May 2017.


--
Mikel L. Forcada http://www.dlsi.ua.es/~mlf/
Departament de Llenguatges i Sistemes Inform?tics
Universitat d'Alacant
E-03690 Sant Vicent del Raspeig
Spain
Office: +34 96 590 9776



------------------------------

Message: 3
Date: Fri, 12 May 2017 09:25:08 +0000
From: "Evans, Richard J." <R.J.Evans@wlv.ac.uk>
Subject: [Moses-support] TC39 - Second call for proposals
To: "moses-support@mit.edu" <moses-support@mit.edu>
Message-ID:
<B8C5913BF277A34B9C638833F0B065EC011F647EC2@Exchmbx10I02.unv.wlv.ac.uk>

Content-Type: text/plain; charset="windows-1252"


39th Translating and the Computer Conference (TC39), 16-17 November 2017, London

2ND CALL FOR PROPOSALS

The International Association for Advancement in Language Technology (AsLing) is delighted to announce the forthcoming 39th edition of the annual Translating and the Computer Conference (TC39) on 16 and 17 November 2017 in London.

The TC conference series has emerged as a leading forum for users, developers and vendors of Translation Technology tools. It is a distinctive event where translators, interpreters, researchers and business people, from translation companies, international organisations, universities and research labs, as well as freelance professionals, come together to exchange ideas and learn about the latest developments in translation and interpretation technologies.

TC conferences feature presentations and posters, panel discussions and workshops. In addition, the conference welcomes two distinguished experts as keynote speakers. In 2017 our keynotes will be given by Roberto Navigli (Universit? degli Studi di Roma "La Sapienza") and Alex Waibel (Carnegie Mellon University, Pittsburgh, and Karlsruher Institut f?r Technologie).

This call invites submissions of extended abstracts for papers, posters and workshop proposals to be given at the TC39 conference.


Conference topics

CAT tools (e.g. Translation Memory systems), Terminology Management tools, Machine Translation (e.g. statistical MT, neural MT, rule-based MT). We particularly welcome contributions addressing these technologies in relation to the following themes: appropriate use, quality assessment and quality control, post-editing, customisation, interoperability and integration of different systems and tools, crowd-sourcing, Natural Language Processing for translation and interpreting, as well as translation and interpreting workflow and management. Other important topics include training (including university translation programmes and the effect of rapid change in the translation industry), tools and resources for interpreters and translators, enhancing collaboration between translators and translation companies, and mobile technologies to support translators? and interpreters' work.

The emphasis of TC39 will be on the new and emerging language technologies, tools and resources which can support the work of interpreters.

Submission guidelines

Proposals for original unpublished papers and posters on all aspects of translation and interpretation technologies are invited, as well as workshop proposals. Papers and posters may report on research, commercial translation and interpretation products or user experience. Proposals should be submitted via the START conference submission system at https://www.softconf.com/i/tc2017 or, in exceptional circumstances and subject to prior confirmation by the conference organisers, may be sent by email to <submissions@asling.org>. Further information is available on the Conference website at http://www.asling.org/tc39<http://www.asling.org/tc39/?page_id=219>.





Schedule

The conference schedule is as follows:

15 June 2017 - deadline for abstracts of papers and posters and workshop proposals
1 August 2017 - all authors notified of decisions
2 October 2017 - speakers' full papers and posters to be submitted for inclusion in the e-proceedings
3 November 2017 ? speakers? presentations to be submitted
16-17 November 2017 - conference takes place in London

Organising and Programme Committees



Organising Committee:

Joanna Drugan, University of East Anglia

Jo?o Esteves-Ferreira, Tradulex, International Association for Quality Translation (conference chair)

Juliet Macan, Language technology consultant (conference chair)

Ruslan Mitkov, University of Wolverhampton (conference chair)

Olaf-Michael Stefanov, United Nations (ret), JIAMCATT (conference chair)

Jean-Marie Vande Walle, Belgian Court of Auditors

Programme Committee:
Anne Aboh Dauvergne, United Nations
Juanjo Arevallillo, Hermes Traducciones and Universidad Alfonso X el Sabio de Madrid
Wilker Aziz, Universiteit van Amsterdam
Sheila Castilho, Adapt Centre, Dublin City University
David Chambers, AsLing Honorary Member
Eleanor Cornelius, University of Johannesburg and FIT Council
Gloria Corpas Pastor, Universidad de M?laga
David Filip, CNGL / ADAPT ? Trinity College, Dublin
Sarah Griffin-Mason, Institute of Translation and Interpreting and University of Portsmouth
Camelia Ignat, Research Centre of the European Union
Joss Moorkens, Dublin City University
Bruno Pouliquen, World Intellectual Property Organization
Antonio Toral, Rijksuniversiteit Groningen
Paola Valli, Taus and Universit? degli Studi di Trieste
Nelson Ver?stegui, International Telecommunications Union (ret)
David Verhofstadt, International Atomic Energy Agency

Further information and contact details

Registration fees and other relevant information about venue, accommodation, conference sponsors and keynote speakers will be published on the Conference website as soon as they are available.

For further information, please email <tc39-info@asling.org<mailto:tc39-info@asling.org>>.


AsLing
Association internationale pour la promotion des technologies linguistiques
International Association for Advancement in Language Technology






Richard Evans,
Research Associate,
Computational Linguistics Research Group,
Research Institute of Information and Language Processing,
University of Wolverhampton,
United Kingdom.
http://rgcl.wlv.ac.uk/~richard
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20170512/69f99032/attachment.html

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 127, Issue 16
**********************************************

0 Response to "Moses-support Digest, Vol 127, Issue 16"

Post a Comment