Moses-support Digest, Vol 113, Issue 63

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Training a Moses translation system on multiple cores
(Sa?o Kuntaric)
2. Re: Training a Moses translation system on multiple cores
(Rajen Chatterjee)
3. Final Call For Papers: Workshop on Corpora and Tools for
Processing Corpora, Tomar, Portugal (Hilario Leal Fontes)


----------------------------------------------------------------------

Message: 1
Date: Wed, 23 Mar 2016 23:03:08 +0100
From: Sa?o Kuntaric <saso.kuntaric@gmail.com>
Subject: [Moses-support] Training a Moses translation system on
multiple cores
To: moses-support@mit.edu
Message-ID:
<CANsquDoqKCWaLLDb1mUnW3ZAx+tcSto5zJwsPbTdE66txt65hQ@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Hi all,

I am trying to train a language system on a server with 2x8 cores. The
problem is that no matter how I add an argument for multiple cores, the
system states it cannot recognize the command. I have tried:

nohup nice ~/mosesdecoder/scripts/training/train-model.perl -root-dir train
-corpus ~/corpus/Individual/combined.clean -f en -e sl -alignment
grow-diag-final-and -reordering msd-bidirectional-fe -lm
0:3:$HOME/corpus/Individual/corpus.blm.sl:8 -external-bin-dir
~/mosesdecoder/tools >& training.out & --decoder-flags="-threads 8"

and

nohup nice ~/mosesdecoder/scripts/training/train-model.perl -root-dir train
-corpus ~/corpus/Individual/combined.clean -f en -e sl -alignment
grow-diag-final-and -reordering msd-bidirectional-fe -lm
0:3:$HOME/corpus/Individual/corpus.blm.sl:8 -external-bin-dir
~/mosesdecoder/tools >& training.out & -cores

Any ideas what I am doing wrong?

Best regards,

Sa?o
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20160323/a206382c/attachment-0001.html

------------------------------

Message: 2
Date: Thu, 24 Mar 2016 10:25:22 +0100
From: Rajen Chatterjee <rajen.k.chatterjee@gmail.com>
Subject: Re: [Moses-support] Training a Moses translation system on
multiple cores
To: Sa?o Kuntaric <saso.kuntaric@gmail.com>
Cc: moses-support@mit.edu
Message-ID:
<CAC4-+NyuBBBn39QJHKGu6Thi8KuBunW0K2HYApg10aMg-JfD+Q@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Hi, you are specifying the parameter in wrong place.
Change the last part of your command from
... >& training.out & --cores to
... --cores 8 >& training.out &
On Mar 23, 2016 11:04 PM, "Sa?o Kuntaric" <saso.kuntaric@gmail.com> wrote:

> Hi all,
>
> I am trying to train a language system on a server with 2x8 cores. The
> problem is that no matter how I add an argument for multiple cores, the
> system states it cannot recognize the command. I have tried:
>
> nohup nice ~/mosesdecoder/scripts/training/train-model.perl -root-dir
> train -corpus ~/corpus/Individual/combined.clean -f en -e sl -alignment
> grow-diag-final-and -reordering msd-bidirectional-fe -lm
> 0:3:$HOME/corpus/Individual/corpus.blm.sl:8 -external-bin-dir
> ~/mosesdecoder/tools >& training.out & --decoder-flags="-threads 8"
>
> and
>
> nohup nice ~/mosesdecoder/scripts/training/train-model.perl -root-dir
> train -corpus ~/corpus/Individual/combined.clean -f en -e sl -alignment
> grow-diag-final-and -reordering msd-bidirectional-fe -lm
> 0:3:$HOME/corpus/Individual/corpus.blm.sl:8 -external-bin-dir
> ~/mosesdecoder/tools >& training.out & -cores
>
> Any ideas what I am doing wrong?
>
> Best regards,
>
> Sa?o
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20160324/5ed728a8/attachment-0001.html

------------------------------

Message: 3
Date: Thu, 24 Mar 2016 11:04:01 +0100
From: Hilario Leal Fontes <h.lealfontes@gmail.com>
Subject: [Moses-support] Final Call For Papers: Workshop on Corpora
and Tools for Processing Corpora, Tomar, Portugal
To: moses-support@mit.edu
Message-ID:
<CAPxkkqNJdw_-kwBUkGNGYxoFDqMASHKzNEuynJFcararUvsAPw@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Final Call For Papers

[apologies for cross-postings]

=====================================================

Corpora and Tools for Processing Corpora
http://propor2016.di.fc.ul.pt/?page_id=383



***NEW DATE*** July 13, 2016 ? Tomar, Portugal

Workshop co-located with PROPOR 2016
http://propor2016.di.fc.ul.pt/



Motivation

A great deal of the popularity of statistical machine translation solutions
is due to the availability of software packages that are making
increasingly easier and faster to train a working machine translation
system. For this deployment to take place, these packages have been seen as
just requiring to be fed with a sufficiently large volume of data,
including some form of parallel corpora of raw text.

While advances in ever more sophisticated aspects of language technology
have permitted this to become increasingly feasible, it has been left in
the shadow the fact that the data needed to feed these systems still
require a considerable deal of preparation. Given the volume of appropriate
corpora needed, this preparation can only be practical if suitable datasets
are available, on the one hand; and, on the other hand, if this preparation
is supported by a number of shallow processing tools, such as boilerplate
removers, tokenisers, orthographic normalisers, hyphenators, foreign word
detectors, inflectional analysers, etc.

While the construction of this type of tools is no longer a hot topic for
cutting-edge research in language technology, resorting to them may turn
out to be in many cases less easy than finding and using the much more
sophisticated modules needed to deploy the machine translation systems.
This is a specially acute situation when it comes to the vast majority of
languages, which are comparatively less resourced than English in terms of
language technology, and it comes to tools performing at the state of the
art level and furthermore are openly available to be reused.

It goes without saying that these negative circumstances go on par with and
get aggravated by the fact that suitable parallel texts are not available
or easy to obtain. Interestingly, many times such tools and datasets exist
and yet their development has never been documented in a publication or
their availability has never been disseminated.



Aims

The present workshop seeks to contribute to improve on this state of
affairs by helping to map both available parallel datasets suitable to feed
statistical machine translation systems and available language processing
tools useful for their preparation.

While pursuing this goal, the workshop seeks also to exchange ideas and
disseminate best practices that help to foster the ELRC and CEF.AT
<http://cef.at/> (http://www.lr-coordination.eu) initiatives.



Call

We thus invite submissions reporting on language resources suitable to
support statistical machine translation from/into Portuguese and on
processing tools for their preparation. Different types of presentations
are possible, under the form of an oral presentation and/or of a
demonstration. While the workshop seeks to attract and promote papers
concerning language resources and tools not yet documented in previous
publications, for the sake of encompassing representativeness, renewed
papers on the other tools and resources are also welcome.



Formats

The submissions should be in the .pdf file format, should not exceed 8
pages, and should use the article template that can be found here:
http://www.springer.com/computer/lncs?SGWID=0-164-6-793341-0 (consider
sections under header ?CS Proceedings and Other Multiauthor Volumes?).
Papers shall be submitted via the EasyChair online platform:
https://www.easychair.org/conferences/conference_change_yes.cgi?a=10930112.



Publication

Accepted papers will be published in a special issue of the journal of the
Portuguese Language Department of the Directorate-General of Translation of
the European Commission, freely available online.



Fees

The participation in the workshop, for authors or non-authors of papers
alike, is free of charge. The organization of the workshop is supported by
the Portuguese Language Department of the Directorate-General of
Translation of the European Commission.



Language

The workshop invites submissions on resources and tools for any language
that fit into the stated aim of this workshop. English is the working
language for submissions and in the workshop.



Dates

February 25: First call for papers

March 24: Final call for papers

April 15: Deadline for submissions

May 16: Notification sent to authors

June 1: Camera-ready papers ready

JULY 13, 2016: Workshop takes place



Organization Committee

Hil?rio Leal Fontes, DGT ? European Commission (chair)

Paulo Batista, DGT ? European Commission

Ant?nio Branco, University of Lisbon



Programme Committee

Ant?nio Branco, University of Lisbon (co-chair)

Hil?rio Leal Fontes, European Commission (co-chair)

Alexandru Ceausu, AMPLEXOR Luxembourg

Aline Villavicencio, Universidade Federal do Rio Grande do Sul

Am?lia Mendes, Centro de Lingu?stica da Universidade de Lisboa

Belinda Maia, Universidade do Porto

Francis Tyers, Universitetet i Troms?

Gabriel Lopes, Faculdade de Ci?ncias e Tecnologia, UNL

Gorka Labaka, University of the Basque Country

Jorge Baptista, CECL/U. Algarve and L2F-Spoken Language Lab/INESC ID Lisboa

Jos? Ramom Pichel Campos, imaxin|software

Lu?s Trigo, LIAAD-INESC Porto L.A.

Lu?sa Coheur, IST/INESC-ID Lisboa

M.T. Carrasco Benitez, European Commission

Maria Jos? Machado, European Commission

Michael Jellinghaus, European Commission

Mikel Forcada, DLSI ? Universitat d?Alacant

Paulo Quaresma, Universidade de ?vora

Paulo Correia, European Commission

Thiago Pardo, Universidade de S?o Paulo

Xavier G?mez Guinovart, Universidade de Vigo



Contacts:

Hil?rio Leal Fontes, hilario.fontes@ec.europa.eu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20160324/74e402be/attachment.html

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 113, Issue 63
**********************************************

0 Response to "Moses-support Digest, Vol 113, Issue 63"

Post a Comment