Moses-support Digest, Vol 85, Issue 5

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Re: Postgraduate Student Internship Oracle - Update
(Sorin Slavescu)
2. Re: -lm training parameter (Tom Hoar)
3. Re: [Question?]How to ensure that all N-best list will be
included in Moses search graph? (LUONG Ngoc Quang)
4. Syntax model in source side (burak ayd?n)


----------------------------------------------------------------------

Message: 1
Date: Mon, 04 Nov 2013 10:28:17 +0000
From: Sorin Slavescu <sorin.slavescu@oracle.com>
Subject: Re: [Moses-support] Postgraduate Student Internship Oracle -
Update
To: moses-support@mit.edu
Message-ID: <527776C1.2010805@oracle.com>
Content-Type: text/plain; charset="utf-8"

Forgot to mention that the Internship starts 2nd of June 2014 for 13 months.

Apologies,
Sorin

On 01/11/13 11:01, Sorin Slavescu wrote:
>
> Worldwide Product Translation Group in Oracle has an opening for an
> internship position in Machine Translation for a Masters or PhD
> student. The internship is available for applied research in
> translation quality improvement for a Moses based SMT system. Projects
> in the areas of incremental training, preprocessing for English to
> Japanese or German, confidence estimation, domain specific MT engines,
> Moses function features for terminology integration, or other similar
> research areas are considered.
>
> Knowledge in SMT and previous experience using Moses toolkit are required.
> Programming skills required: C++ necessary, Java and Perl an advantage.
>
> The internship position is for 13 months with a competitive salary
> plus benefits. The successful candidate will be required to work from
> our Dublin office locatedin EastPointBusinessPark,Dublin3.
>
> Please submit your CV and a description of the MT research project(s)
> you are involved in to sorin.slavescu@oracle.com
>
> Closing date: Monday 11th of November 2013.
>
> --
>
>
> ORACLE <http://www.oracle.com>
> Sorin Slavescu | Principal Software Engineer
> Phone: +35318031937 | E-mail: sorin.slavescu@oracle.com
> Oracle Worldwide Product Translation (WPTG) - Tools
> Block P5, East Point Business Park | Dublin 3, Ireland
> Oracle is committed to developing practices and products that help
> protect the environment <http://www.oracle.com/commitment>
>
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support

--


ORACLE <http://www.oracle.com>
Sorin Slavescu | Principal Software Engineer
Phone: +35318031937 | E-mail: sorin.slavescu@oracle.com
Oracle Worldwide Product Translation (WPTG) - Tools
Block P5, East Point Business Park | Dublin 3, Ireland
Oracle is committed to developing practices and products that help
protect the environment <http://www.oracle.com/commitment>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131104/2406e698/attachment-0001.htm

------------------------------

Message: 2
Date: Mon, 04 Nov 2013 19:48:39 +0700
From: Tom Hoar <tahoar@precisiontranslationtools.com>
Subject: Re: [Moses-support] -lm training parameter
To: moses-support@mit.edu
Message-ID: <527797A7.5040804@precisiontranslationtools.com>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

Yes, on both counts. You can edit the moses.ini file to change to a
different LM. Editing the train-model.perl script should work. We take a
different approach. We create a temporary /tmp/placeholder.lm before
running the script and then remove it afterwards. We then regex the
pattern and change the moses.ini file to any LM we want.


On 11/04/2013 04:57 AM, Read, James C wrote:
> Thanks.
>
> So if you wanted to train and at a later date use a different LM with the already trained TM would it just be a simple case of manually editing moses.ini?
>
> If I were to edit the training script to skip the check that LM file exists (it doesn't) it wouldn't break anything would it?
>
> James
>
> ________________________________________
> From:moses-support-bounces@mit.edu [moses-support-bounces@mit.edu] on behalf of Tom Hoar [tahoar@precisiontranslationtools.com]
> Sent: 03 November 2013 13:03
> To:moses-support@mit.edu
> Subject: Re: [Moses-support] -lm training parameter
>
> You are correct that train-model.perl script does not use the -lm
> parameter through any of the word alignment or phrase scoring steps. The
> script's step 9 builds a template moses.ini configuration file and
> includes the values from the -lm parameter. At the beginning, the script
> checks that the -lm value points to a non-zero length file. If the file
> is missing or is zero length, the script halts.
>
>
>
> On 11/03/2013 06:03 PM, Read, James C wrote:
>> Hi,
>>
>> does anybody know what the effect of the -lm training parameter in the training script is? Surely the LM used has no effect on typical training tasks like word alignment and phrase scoring?
>>
>> thanks,
>> James
>>
>> _______________________________________________
>> Moses-support mailing list
>> Moses-support@mit.edu
>> http://mailman.mit.edu/mailman/listinfo/moses-support
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support



------------------------------

Message: 3
Date: Mon, 4 Nov 2013 14:01:04 +0100
From: LUONG Ngoc Quang <quangngocluong@gmail.com>
Subject: Re: [Moses-support] [Question?]How to ensure that all N-best
list will be included in Moses search graph?
To: Hieu Hoang <Hieu.Hoang@ed.ac.uk>
Cc: moses-support <moses-support@mit.edu>, "benjamin.lecouteux@imag.fr
Lecouteux" <benjamin.lecouteux@imag.fr>
Message-ID: <8EA1D5FF-B251-4C7C-ADEF-10311B62A655@gmail.com>
Content-Type: text/plain; charset="windows-1252"

Hi,
I've just sent you all the resources needed but i got an notification saying that my email is now being held awaiting the moderator approval (my attachment is about 13 MB). Can you approve this email or suggest me another sharing method?
Thanks so much,
bests,
quang




On Nov 4, 2013, at 1:27 AM, Hieu Hoang <Hieu.Hoang@ed.ac.uk> wrote:

> can you please send me the output search graph and the n-best list for the 1 sentence where you are seeing this error.
>
> Also, please send me the script you use to check whether a sentence is in the graph.
>
> I don't think it is possible for a sentence to be in the nbest list but not in the search graph, but i will take a look
>
>
> On 1 November 2013 14:17, LUONG Ngoc Quang <quangngocluong@gmail.com> wrote:
> Hi all,
> Currently i would like to exploit the Moses search graph as an alternative for N-best list to improve MT, thanks to a huge number of hypotheses it contains. Nevertheless when i try to output all the hypotheses in the search graph, i observe that they don't cover all N-best list. In other words, there are still several hypotheses in N-best (N=1000) not included in the search graph.
> Even when i increase the number of hypotheses popped for each stack (up to 10000 for example), this coverage can not be obtained.
> So, i would like to ask all of you whether we have any Moses option (command, script?) to ensure that the search graph will contain all N best list of each source sentence?
> For information, the command we invoked to generate the search graph and the N-best list is as follows:
>
> ./home/toolkits/Moses/moses-04-2009/moses-cmd/src/moses
> -f output10881/moses.ini
> -mbr -cube-pruning-pop-limit 10000
> -s 10000
> -output-search-graph search_graph_881
> -use-alignment-info
> -print-alignment-info
> -include-alignment-in-n-best
> -print-alignment-info-in-n-best -n-best-list ./Nbest-881test 1000
> < ./pre-processed-corpus/881.lowercased.fr
> > ./pre-processed-corpus/881.hyp.SG
>
> Furthermore, the way we conduct to extract all hypotheses (sentences) from the search graph is : We start from each final node, move backward until reach the initial one. First, the hypothesis is set : s= "". In each step, we get the field "out" of the current node (the translated word(s)) and add to the hypothesis : s = out + s.
> Is this the right way to obtain all hypotheses?
>
> Thanks a lot and all suggestions would be very welcomed !
> bests,
> Quang
>
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
>
> --
> Hieu Hoang
> Research Associate
> University of Edinburgh
> http://www.hoang.co.uk/hieu
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131104/350fc02f/attachment-0001.htm

------------------------------

Message: 4
Date: Mon, 4 Nov 2013 15:53:51 +0200
From: burak ayd?n <baydinx@gmail.com>
Subject: [Moses-support] Syntax model in source side
To: moses-support@mit.edu
Message-ID:
<CAH+r-S+X-BcnCTmDWEod6Qy1+w__SFmdOR74HvaNm=y3N00guQ@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"

Hi everyone,

I want to use Collins parser while translating from En. I checked the
sample ems configs and applied it. The experiment did not crash or get any
error, but bleu scores were dramatically low, implying that there must be
something wrong. Here the additional parameters for sytnax with Collins' :

#syntactic parsers
input-parser = "$moses-script-dir/training/wrappers/parse-en-collins.perl
-collins /usr/local/smt/COLLINS-PARSER -mxpost /usr/local/smt/MXPOST/ "

#training options
training-options = "-mgiza -mgiza-cpus 4 -sort-buffer-size 8G
-sort-compress gzip -sort-parallel 4 -cores 4 -source-syntax"

Do I need additional parameters except the ones above? I would appreciate
any help.

Thanks
Burak
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131104/668794b7/attachment.htm

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 85, Issue 5
********************************************

0 Response to "Moses-support Digest, Vol 85, Issue 5"

Post a Comment