Send Moses-support mailing list submissions to
moses-support@mit.edu
To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu
You can reach the person managing the list at
moses-support-owner@mit.edu
When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."
Today's Topics:
1. Re: Incremental training after align_new.sh (prajdabre)
2. Fwd: Moses-support post from sarah.schulz@ugent.be requires
approval (Hieu Hoang)
3. missing .gz files at the end of mgiza (Vito Mandorino)
----------------------------------------------------------------------
Message: 1
Date: Fri, 20 Feb 2015 12:19:21 +0900
From: prajdabre <prajdabre@gmail.com>
Subject: Re: [Moses-support] Incremental training after align_new.sh
To: ?franky <wanghaozhou@hotmail.com>, "moses-support@mit.edu"
<moses-support@mit.edu>
Message-ID: <kh7qq26ab8363q683gx936u0.1424402361102@email.android.com>
Content-Type: text/plain; charset="utf-8"
Hello,
After force alignment is done you get some files which contain alignments and some which contain the counts and probabilities.?
You should only consider the ones containing alignments.
For example there will be a file like: *.A3.final generated by align_new.sh which shows which word is aligned to which in every sentence. You must append its contents to the corresponding file in generated by the initial training (full_train.sh).
Hope this is clear.
Sent from Samsung Mobile
-------- Original message --------
From: ?franky <wanghaozhou@hotmail.com>
Date: 20/02/2015 00:05 (GMT+09:00)
To: moses-support@mit.edu
Subject: [Moses-support] Incremental training after align_new.sh
Hello everyone
?
When I using align_new.sh for the incremental training, I can?t understand ?append the forward alignments (fwd) generated by align_new.sh to the forward (fwd) alignments generated by full_train.sh? witch is written in your mail. For me, the files generated by align_new.sh such as ?test.new.en_test.new.fr.dict.p0_3.final? cannot be appended directly to the files generated by full_train.sh. Could some one give me a clearer explication?
?
Thanks
?
Best Regards
?
?
Tortue WANG
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20150220/96a5b68f/attachment-0001.htm
------------------------------
Message: 2
Date: Fri, 20 Feb 2015 10:44:15 +0000
From: Hieu Hoang <hieuhoang@gmail.com>
Subject: [Moses-support] Fwd: Moses-support post from
sarah.schulz@ugent.be requires approval
To: sarah.schulz@ugent.be, moses-support <moses-support@mit.edu>
Message-ID: <54E70FFF.6070000@gmail.com>
Content-Type: text/plain; charset="windows-1252"
please subscribe to the Moses mailing list. You can subscribe here:
http://mailman.mit.edu/mailman/listinfo/moses-support
to answer your question:
./moses -feature-add "PhraseDictionaryCompact path=..." "KENLM
path=..." -weight-add "PhraseDictionaryCompact0= 0.1 0.2 0.3 0.4" "LM0=
0.7"
-------- Forwarded Message --------
Subject: Moses-support post from sarah.schulz@ugent.be requires approval
Date: Fri, 20 Feb 2015 05:32:55 -0500
From: moses-support-owner@mit.edu
To: moses-support-owner@mit.edu
As list administrator, your authorization is requested for the
following mailing list posting:
List: Moses-support@mit.edu
From: sarah.schulz@ugent.be
Subject: Command line switch for LM and ttable
Reason: Post by non-member to a members-only list
At your convenience, visit:
http://mailman.mit.edu/mailman/admindb/moses-support
to approve or deny the request.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20150220/0b1ce852/attachment-0001.htm
-------------- next part --------------
An embedded message was scrubbed...
From: Sarah Schulz <Sarah.Schulz@UGent.be>
Subject: Command line switch for LM and ttable
Date: Fri, 20 Feb 2015 11:32:51 +0100
Size: 3926
Url: http://mailman.mit.edu/mailman/private/moses-support/attachments/20150220/0b1ce852/attachment-0002.eml
-------------- next part --------------
An embedded message was scrubbed...
From: moses-support-request@mit.edu
Subject: confirm ec261b3fbfdb59c1c2b65dd6504d04b3f4d950b2
Date: no date
Size: 631
Url: http://mailman.mit.edu/mailman/private/moses-support/attachments/20150220/0b1ce852/attachment-0003.eml
------------------------------
Message: 3
Date: Fri, 20 Feb 2015 12:18:08 +0100
From: Vito Mandorino <vito.mandorino@linguacustodia.com>
Subject: [Moses-support] missing .gz files at the end of mgiza
To: moses-support <moses-support@mit.edu>
Message-ID:
<CA+8mSmEU5sp0Cq6h2+7xeft9CzWC1unTqc4-t3hrrTZbOHA8PA@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"
Dear All,
I am training a model with placeholders from French to English and the
process ends before the end of training step 2, without creating the
en-fr.AR.final.gz and fr-en.AR.final.gz files in the respective
folders giza.en-fr and giza.fr-en .
I cannot understand why. Here's the last 7 lines of the training.out file:
Entire Viterbi H333444 Training took: 78424 seconds
==========================================================
Entire Training took: 134751 seconds
Program Finished at: Fri Feb 20 05:45:19 2015
==========================================================
and here's the command used for training:
nohup nice /home/Moses/mosesdecoder/scripts/training/train-model.perl \
--parallel \
-mgiza -mgiza-cpus 20 \
-root-dir /home/BRIQUES/train-leclerc-light-ph-fren/training \
-external-bin-dir /root/external-bin-dir/ \
-corpus
/home/BRIQUES/train-leclerc-light-ph-enfr/data/Corpus.leclerc-ph.cleanclean
-f fr -e en \
-alignment grow-diag-final-and -reordering msd-bidirectional-fe \
-lm 0:5:/home/DATA/LM/RapAnSoc/lm.RapAnSoc5-ph.blm.en.mm:8 \
-write-lexical-counts \
-extract-options '--Placeholders @num@,@rom@,@alpha@' \
>& /home/BRIQUES/train-leclerc-light-ph-fren/training/training.out &
It seems a bit odd to me that the analogous training in the reversed
English-French direction has worked nicely.
In fact, in this case the folders giza.en-fr and giza.fr-en do contain not
only the .gz files but also two more "chunks" en-fr.A3.final.partD and
en-fr.A3.final.partE, whereas the French-English training stops at
fr-en.A3.final.partC.
Thank you,
Vito Mandorino
--
[image: Description : Description : lingua_custodia_final full logo]
*The Translation Trustee*
*1, Place Charles de Gaulle, **78180 Montigny-le-Bretonneux*
*Tel : +33 1 30 44 04 23 Mobile : +33 6 84 65 68 89*
*Email :* *vito.mandorino@linguacustodia.com
<massinissa.ahmim@linguacustodia.com>*
*Website :* *www.linguacustodia.com <http://www.linguacustodia.com/> -
www.thetranslationtrustee.com <http://www.thetranslationtrustee.com/>*
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20150220/e79484fd/attachment.htm
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.jpg
Type: image/jpeg
Size: 4421 bytes
Desc: not available
Url : http://mailman.mit.edu/mailman/private/moses-support/attachments/20150220/e79484fd/attachment.jpg
------------------------------
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support
End of Moses-support Digest, Vol 100, Issue 68
**********************************************
Subscribe to:
Post Comments (Atom)
0 Response to "Moses-support Digest, Vol 100, Issue 68"
Post a Comment