Send Moses-support mailing list submissions to
moses-support@mit.edu
To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu
You can reach the person managing the list at
moses-support-owner@mit.edu
When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."
Today's Topics:
1. Re: Fwd: Moses-support post from
shachar.mirkin@xrce.xerox.com requires approval (Ulrich Germann)
2. Multiple reordering models while creating ini at step 9
(jian zhang)
3. Re: Failed to compile moses (Hieu Hoang)
----------------------------------------------------------------------
Message: 1
Date: Tue, 18 Mar 2014 18:50:43 +0000
From: Ulrich Germann <ulrich.germann@gmail.com>
Subject: Re: [Moses-support] Fwd: Moses-support post from
shachar.mirkin@xrce.xerox.com requires approval
To: "Mirkin, Shachar" <shachar.mirkin@xrce.xerox.com>
Cc: Hieu Hoang <hieu.hoang@ed.ac.uk>, moses-support
<moses-support@mit.edu>
Message-ID:
<CAHQSRUqJGvCrROv795hTN0-rmt6SF=T0+9tLtRVK7GoFOurEPw@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"
PhraseDictionaryDynSuffixArray is deprecated and should not be used any
more. It will be replaced with memory-mapped suffix array phrase tables
(mmsapt) which are currently in the branch dynamic-phrase-tables.
In order to use them, you need:
- the two text files, one sentence per line
- the word alignments in symal format
let fr be the language tag for the language you are translating from and en
the tag for the language we are translating to
cat train.fr | mtt-build -i -o train.fr
cat train.en | mtt-build -i -o train.en
cat train.symal | symal2mam train.fr-en.mam
mmlex-build train fr en -o train.fr-en.lex -c train.fr-en.coc
then in moses.ini, the line for the phrase table should look like this:
Mmsapt name=PT0 output-factor=0 num-features=5 base=/path/to/train L1=fr
L2=en
No guarantee that this works; this is work in progress. Probably won't work
on Mac, and works in multi-threaded mode only.
- Uli
On Mon, Mar 17, 2014 at 4:17 PM, Mirkin, Shachar <
shachar.mirkin@xrce.xerox.com> wrote:
> Hi,
>
> I'm now subscribed also from this email address.
>
> Let me give more details about the problems that I encountered.
> Trying to load the Moses server with the modified ini file, after
> replacing the PhraseDictionaryBinary line with:
>
> PhraseDictionaryDynSuffixArray source=<path-to-source-corpus> target=<path-to-target-corpus> alignment=<path-to-alignments>
>
> (with the correct paths, of course), I got:
>
> Feature function PhraseDictionaryDynSuffixArray0 specified 1 dense scores
> or weights. Actually has 0
>
> This was solved by adding "num-features=0" to the
> PhraseDictionaryDynSuffixArray line.
>
> The next error was:
>
> ...
> Loading source corpus...
> terminate called after throwing an instance of
> 'Moses::StrayFactorException'
> what(): moses/Word.cpp:112 in void
> Moses::Word::CreateFromString(Moses::FactorDirection, const
> std::vector<long unsigned int, std::allocator<long unsigned int> >&, const
> StringPiece&, bool) threw StrayFactorException because `fit'.
> You have configured 0 factors but the word le contains factor delimiter |
> too many times.
>
> In this test my source, target and alignment files consist each of a
> single line with no "|"s, and the word "le" is the first one in the source.
>
> Is there anything else I should do in the ini file?
>
> Thanks,
> Shachar
>
>
>
>
> On 03/17/2014 02:58 PM, Hieu Hoang wrote:
>
> Hi Shachar
>
> can you please subscribe to the mailing list before posting to it. It's
> a public email address so there's a lot of automated spammers. You can
> subscribe here
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
> To answer you question - the webpage does document it in the new ini
> format, eg.
> PhraseDictionaryDynSuffixArray source=<path-to-source-corpus> ...
> Do you have a printout of the old version?
>
> Also, the dynamic suffix array is undergoing updates as Uli Germann
> (cc'ed) is updating it with more features. He can tell you more about it
>
>
> ---------- Forwarded message ----------
> From: <moses-support-owner@mit.edu>
> Date: 17 March 2014 12:13
> Subject: Moses-support post from shachar.mirkin@xrce.xerox.com requires
> approval
> To: moses-support-owner@mit.edu
>
>
> As list administrator, your authorization is requested for the
> following mailing list posting:
>
> List: Moses-support@mit.edu
> From: shachar.mirkin@xrce.xerox.com
> Subject: Incremental training and the new ini format
> Reason: Post by non-member to a members-only list
>
> At your convenience, visit:
>
> http://mailman.mit.edu/mailman/admindb/moses-support
>
> to approve or deny the request.
>
>
> ---------- Forwarded message ----------
> From: "Mirkin, Shachar" <shachar.mirkin@xrce.xerox.com>
> To: moses-support@mit.edu
> Cc:
> Date: Mon, 17 Mar 2014 13:06:47 +0100
> Subject: Incremental training and the new ini format
> Hi,
>
> I'm trying to use incremental training with the latest Moses version, but
> the documentation refers to the old ini format (
> http://www.statmt.org/moses/?n=Moses.AdvancedFeatures#ntoc34).
> Can you please explain what changes are required to get the incremental
> training working with the new ini format?
>
> Thanks,
> Shachar
>
>
>
>
> ---------- Forwarded message ----------
> From: moses-support-request@mit.edu
> To:
> Cc:
> Date:
> Subject: confirm 2701c5fb8f659b6037c9e0bf07ad70095ba4ffe2
> If you reply to this message, keeping the Subject: header intact,
> Mailman will discard the held message. Do this if the message is
> spam. If you reply to this message and include an Approved: header
> with the list password in it, the message will be approved for posting
> to the list. The Approved: header can also appear in the first line
> of the body of the reply.
>
>
>
> --
> Hieu Hoang
> Research Associate
> University of Edinburgh
> http://www.hoang.co.uk/hieu
>
>
>
--
Ulrich Germann
Research Associate
School of Informatics
University of Edinburgh
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20140318/dca628c5/attachment-0001.htm
------------------------------
Message: 2
Date: Tue, 18 Mar 2014 22:20:57 +0000
From: jian zhang <zhangj@computing.dcu.ie>
Subject: [Moses-support] Multiple reordering models while creating ini
at step 9
To: moses-support@mit.edu
Message-ID:
<CALA=z0C11sbnMLw4Ta3OVMsFZE7LyF+LZkpL-gTp_FGP4_-mkA@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"
Hi,
Just a bug, i think
Line 2011 at
https://github.com/moses-smt/mosesdecoder/blob/master/scripts/training/train-model.perl
.
The $i++ should be placed into the loop while creating multiple reordering
models, otherwise all reordering models will have index 0.
Jian Zhang
--
Jian Zhang
Centre for Next Generation Localisation (CNGL)<http://www.cngl.ie/index.html>
Dublin City University <http://www.dcu.ie/>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20140318/2faabfa8/attachment-0001.htm
------------------------------
Message: 3
Date: Wed, 19 Mar 2014 00:07:34 +0000
From: Hieu Hoang <Hieu.Hoang@ed.ac.uk>
Subject: Re: [Moses-support] Failed to compile moses
To: steven.xu@lba.ca
Cc: moses-support <moses-support@mit.edu>
Message-ID:
<CAEKMkbjP0mJyN4KTbvgbaN4F-HaetEFfBx0DoS30stvOKYfcXQ@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"
you have to run
./bjam --with-boost=/root/Desktop/boost_1_55_0 -j8
not
./jam-files/bjam ....
On 17 March 2014 20:33, Steven Xu <steven.xu@lba.ca> wrote:
> Hi, All
>
> I have problem to compile moses. I followed exactly the steps described in
> the online document( http://www.statmt.org/moses/?
> n=Development.GetStarted):
>
> 1) I compiled boost successfully.
> wget http://downloads.sourceforge.net/project/boost/boost/1.55.
> 0/boost_1_55_0.tar.gz?r=http%3A%2F%2Fsourceforge.net%
> 2Fprojects%2Fboost%2Ffiles%2Fboost%2F1.55.0%2F&ts=
> 1389613041&use_mirror=kent
> tar zxvf boost_1_55_0.tar.gz
> cd boost_1_55_0/
> ./bootstrap.sh
> ./b2 -j8 --prefix=$PWD --libdir=$PWD/lib64 --layout=tagged link=static
> threading=multi,single install || echo FAILURE
>
> 2) I run into probem at
> ./bjam --with-boost=/root/Desktop/boost_1_55_0 -j8
>
> This is the command I collected the logs:
> ./jam-files/bjam --with-boost=/root/Desktop/boost_1_55_0 -j8
> --debug-configuration -d2 |gzip >build.log.gz
>
> Thanks for your answer.
>
> Steven Xu
>
> --
> Thanks
> Steven Xu
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
--
Hieu Hoang
Research Associate
University of Edinburgh
http://www.hoang.co.uk/hieu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20140319/22b13ccd/attachment.htm
------------------------------
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support
End of Moses-support Digest, Vol 89, Issue 41
*********************************************
Subscribe to:
Post Comments (Atom)
0 Response to "Moses-support Digest, Vol 89, Issue 41"
Post a Comment