Moses-support Digest, Vol 142, Issue 3

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Re: Issues running MGiza on AWS machine (James Baker)


----------------------------------------------------------------------

Message: 1
Date: Wed, 1 Aug 2018 15:46:34 +0100
From: James Baker <james.d.baker@gmail.com>
Subject: Re: [Moses-support] Issues running MGiza on AWS machine
To: tahoar@slate.rocks
Cc: moses-support@mit.edu
Message-ID:
<CAOa=L2wmzCDynB=FnqV01bP+zK7HqaviVtyOni1n8GbLXT_L2A@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Thanks Tom - using the Moses version of symal rather than the MGiza version
fixed it (although still not sure why it should be different to the one I
built on my desktop). I hadn't realised they were different, as the
instructions on the Moses website state you should copy all binaries from
MGiza into the Moses directory:
http://www.statmt.org/moses/?n=Moses.ExternalTools#ntoc3

Thanks for your help.

James

On Wed, 1 Aug 2018 at 02:14, Tom Hoar <tahoar@slate.rocks> wrote:

> Hi James,
>
> Since train-model.perl fails at step 4 fails with the MGIZA binaries you
> build on your AWS machine, but succeeds when you copy MGIZA binaries that
> you built on your local Ubuntu 16.04 machine, do the build logs show a
> missing dependency?
>
> My next question, why don't you just use the binaries that work? It seems
> like the AWS machine's Ubuntu distro is missing dependencies and the
> MGIZA++ build failed. Check those build logs.
>
> If you want to troubleshoot deeper, you need to backtrack from step 4. The
> train-model.perl step 4 uses the output from step 3, i.e. the word
> alignment file. Check if that word alignment file is corrupted.
>
> Then check step 3, its inputs are the GIZA alignment files output in step
> 2. This step uses the symal binary executable. Make sure you're using the
> Moses version of symal, not the one in the MGIZA library.
> http://article.gmane.org/gmane.comp.nlp.moses.user/11544
> http://moses-support.mit.narkive.com/KpKC2TQn/which-symal
>
> Backtracking to step 2, log lines with the following text messages should
> cause the mgiza executable to terminate but it doesn't. The parallel forks
> in train-model.perl mask the failure, processing continues and you
> experience ambiguous failures downstream.
>
> ERROR: A SOURCE or TARGET sentence has a zero-length sentence.
> ERROR! DUPLICATED ENTRY
> WARNING: The following sentence pair has source/target sentence length
> ration more than
>
> There are rarely errors in Step 1, but if you are experiencing a compile
> error on AWS, those MGIZA binaries in step 1 could be the cause.
>
> Also, the C++ binary executables are not the only things that change when
> you use use the alternate build. If you also copied the merge_alignment.py,
> this could be a problem in train-model.perl step 2. Make sure the AWS build
> has this in the right place and that it runs on the AWS distro's Python
> interpreter.
>
> Tom
>
>
>
> On 7/31/2018 11:01 PM, moses-support-request@mit.edu wrote:
>
> Date: Tue, 31 Jul 2018 11:41:01 +0100
> From: James Baker <james.d.baker@gmail.com> <james.d.baker@gmail.com>
> Subject: [Moses-support] Issues running MGiza on AWS machine
> To: moses-support@mit.edu
> Message-ID:
> <CAOa=L2woDwDETeeq7RAs0LxmnbAA_Q7qH=kF8aPAH7ivBk0qgA@mail.gmail.com> <CAOa=L2woDwDETeeq7RAs0LxmnbAA_Q7qH=kF8aPAH7ivBk0qgA@mail.gmail.com>
> Content-Type: text/plain; charset="utf-8"
>
> Hi,
>
> I'm having some peculiar issues with MGiza++. Using MGiza and Moses, I've
> successfully built some translation models on my Ubuntu 16.04 desktop
> machine. I'd now like to do the same thing, but on a machine hosted in AWS.
>
> I'm using the same operating system, and as far as I can tell all my
> versions are identical. The build of MGiza++ runs fine, reports no errors,
> and produces output the same as on my desktop machine. However, when I try
> to build the models, I get a whole load of errors and the resultant models
> are empty (64 bytes for the reordering model, 0 bytes for the translation
> model - the language model builds fine).
>
> The first "errors" I can see in the log seem to occur on stage 4 of the
> Moses training script (train-model.perl):
>
> (4) generate lexical translation table 0-0 @ Tue Jul 31 10:22:58 UTC 2018
> (/opt/model-builder/training/data.ru
> ,/opt/model-builder/training/data.en,/opt/model-builder/training/model/lex)
> !Argument "anna" isn't numeric in numeric ge (>=) at
> /opt/model-builder/mosesdecoder/scripts/training/LexicalTranslationModel.pm
> line 112, <A> line 1.
> Use of uninitialized value $ei in numeric ge (>=) at
> /opt/model-builder/mosesdecoder/scripts/training/LexicalTranslationModel.pm
> line 112, <A> line 1.
> Use of uninitialized value $ei in hash element at
> /opt/model-builder/mosesdecoder/scripts/training/LexicalTranslationModel.pm
> line 118, <A> line 1.
> Use of uninitialized value $ei in array element at
> /opt/model-builder/mosesdecoder/scripts/training/LexicalTranslationModel.pm
> line 121, <A> line 1.
> Use of uninitialized value $ei in array element at
> /opt/model-builder/mosesdecoder/scripts/training/LexicalTranslationModel.pm
> line 123, <A> line 1.
> ...
>
> There are a large number of errors of that nature, and following those
> errors there are additional errors but I suspect these are caused by the
> fact that this stage is failing.
>
> It's possible that there are earlier problems, but I'm not really sure what
> to be looking for in the logs (for instance - there are some lines warning
> about alignments in Model2 being 0 - is that an issue?).
>
> If I replace the MGiza binaries built on the AWS machine with the binaries
> built on my desktop, it runs fine - so I know it's an issue with MGiza and
> presumably something to do with my build. The commands I'm running to build
> and install are as follows
>
> git clone https://github.com/moses-smt/mgiza.git
> cd mgiza/mgizapp
> cmake .
> make
> make install
> cp bin/* ../../mosesdecoder/bin
> cp scripts/merge_alignment.py ../../mosesdecoder/bin
>
> As I mentioned previously, these commands work fine on my desktop machine
> which should be a very similar (if not identical) set up.
>
> Does anyone have any ideas as to what might be causing the problem (or,
> more importantly, what I can do to fix it)?
>
> Thanks in advance,
> James
>
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20180801/0a79751e/attachment-0001.html

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 142, Issue 3
*********************************************

0 Response to "Moses-support Digest, Vol 142, Issue 3"

Post a Comment