Moses-support Digest, Vol 89, Issue 74

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Problems by training Moses (Ricardo Cabello S?nchez)
2. Re: Is it possible to use -xml-input flag with the recaser ?
(kamel nebhi)
3. Re: Problems by training Moses (Philipp Koehn)


----------------------------------------------------------------------

Message: 1
Date: Fri, 28 Mar 2014 17:57:45 +0100
From: Ricardo Cabello S?nchez
<ricardo.cabello.sanchez@googlemail.com>
Subject: [Moses-support] Problems by training Moses
To: moses-support@mit.edu
Message-ID:
<CAJxWzkZUfiEe4M7PgzgxCgLVptqfTFqNo1Np5CKc3wS7GpNCSQ@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"

Dear all,


I'm new in this mailing list. Sorry if I am asking something very obvious
and easy but I'm pretty new in this field.

I'm starting working with Moses for my PhD in computational linguistics and
now, once I think installation went ok, I am trying to train Moses with
provided default corpora. Process crashes in the early preparation data
step. I have been solving other errors but now I think I would need help
with this. I get these errors (the whole process pasted below) that I can
solve.

Could you please help?

<<<
ricardo@ricardo-Satellite-L40:~/mosesdecoder/primera_vez$ ./experiment.perl
-config config_primera_vez.toy -exec
STARTING UP AS PROCESS 4267 ON ricardo-Satellite-L40 AT dom mar 23 12:34:43
CET 2014
LOAD CONFIG...
working directory is /home/ricardo/mosesdecoder/primera_vez
running experimenal run number 17

ESTABLISH WHICH STEPS NEED TO BE RUN

FIND DEPENDENCIES BETWEEN STEPS

CHECKING IF OLD STEPS ARE RE-USABLE

STEP SUMMARY:
59 CORPUS:toy:tokenize -> re-using (1)
58 CORPUS:toy:clean -> re-using (1)
54 CORPUS:toy:truecase -> re-using (1)
49 TRUECASER:consolidate -> re-using (1)
48 TRUECASER:train -> re-using (1)
47 LM:toy:tokenize -> re-using (1)
45 LM:toy:truecase -> re-using (1)
43 LM:toy:train -> re-using (1)
40 LM:toy:binarize -> re-using (1)
39 TRAINING:consolidate -> re-using (1)
38 TRAINING:prepare-data -> run
37 TRAINING:run-giza -> run
36 TRAINING:run-giza-inverse -> run
35 TRAINING:symmetrize-giza -> run
34 TRAINING:build-lex-trans -> run
31 TRAINING:extract-phrases -> run
30 TRAINING:build-reordering -> run
29 TRAINING:build-ttable -> run
26 TRAINING:create-config -> run
24 TUNING:apply-weights -> run
23 EVALUATION:test:input-from-sgm -> re-using (1)
22 EVALUATION:test:tokenize-input -> re-using (1)
17 EVALUATION:test:truecase-input -> re-using (1)
15 EVALUATION:test:filter -> run
14 EVALUATION:test:apply-filter -> run
13 EVALUATION:test:decode -> run
12 EVALUATION:test:remove-markup -> run
10 EVALUATION:test:detruecase-output -> run
9 EVALUATION:test:detokenize-output -> run
8 EVALUATION:test:wrap -> run
7 EVALUATION:test:reference-from-sgm -> re-using (1)
6 EVALUATION:test:tokenize-reference -> re-using (1)
4 EVALUATION:test:nist-bleu -> run
3 EVALUATION:test:nist-bleu-c -> run
2 EVALUATION:test:analysis -> run
1 EVALUATION:test:analysis-coverage -> run
0 REPORTING:report -> run
convert: iCCP: profile 'default_rgb.icc': 0h: PCS illuminant is not D50
`/tmp/magick-4283knebIQusILAg1' @
warning/png.c/MagickPNGWarningHandler/1830.
Warning: Cannot convert string
"-*-Helvetica-Medium-R-Normal--*-140-*-*-P-*-ISO8859-1" to type FontStruct
Warning: Cannot convert string
"-*-Helvetica-Medium-R-Normal--*-120-*-*-P-*-ISO8859-1" to type FontStruct
Warning: Cannot convert string
"-*-Helvetica-Medium-R-Normal--*-100-*-*-P-*-ISO8859-1" to type FontStruct
Warning: Cannot convert string
"-*-Helvetica-Bold-R-Normal--*-120-*-*-P-*-ISO8859-1" to type FontStruct
convert: iCCP: profile 'default_rgb.icc': 0h: PCS illuminant is not D50
`/tmp/magick-4292U5L_nYYBb8pG1' @
warning/png.c/MagickPNGWarningHandler/1830.

EXECUTE STEPS
number of steps doable or running: 1 at dom mar 23 12:34:45 CET 2014
doable: TRAINING:prepare-data
executing
/home/ricardo/mosesdecoder/primera_vez/steps/17/TRAINING_prepare-data.17
via sh (1 active)
convert: iCCP: profile 'default_rgb.icc': 0h: PCS illuminant is not D50
`/tmp/magick-4307QgvlrdxUKXXc1' @
warning/png.c/MagickPNGWarningHandler/1830.
step TRAINING:prepare-data crashed
number of steps doable or running: 0 at dom mar 23 12:34:54 CET 2014
convert: iCCP: profile 'default_rgb.icc': 0h: PCS illuminant is not D50
`/tmp/magick-4333yx8taQhVp75g1' @
warning/png.c/MagickPNGWarningHandler/1830.

>>>

Thank you and best regards,

Ricardo
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20140328/10ff278e/attachment-0001.htm

------------------------------

Message: 2
Date: Fri, 28 Mar 2014 17:37:26 +0000
From: kamel nebhi <k.nebhi@sheffield.ac.uk>
Subject: Re: [Moses-support] Is it possible to use -xml-input flag
with the recaser ?
To: Hieu Hoang <Hieu.Hoang@ed.ac.uk>
Cc: moses-support <moses-support@mit.edu>
Message-ID:
<CAG66Y3d5SobpLYhZMZ_Xd3D0jPtRecV2=F0vr0xW-Gb+7HeC_g@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Thanks for your help. I try it adding the [xml-input] to the moses.ini file.
However, recaser no longer works when i put this flag.
I have the output file with original capitalization.








2014-03-27 11:46 GMT+00:00 Hieu Hoang <Hieu.Hoang@ed.ac.uk>:

> try adding
> [xml-input]
> whatever
>
> to the recaser's moses.ini file. The recaser is just a monotone
> phrase-based model
>
>
> On 23 March 2014 20:53, kamel nebhi <k.nebhi@sheffield.ac.uk> wrote:
>
>> Hello,
>>
>> I'm actually use the recaser provided by Moses. I want to ignore some
>> part of the corpus and i'm looking for a solution such as the -xml-input
>> flag (see at http://www.statmt.org/moses/?n=Moses.AdvancedFeatures#ntoc9
>> ).
>>
>> Is it possible to use this feature with the recaser ? If no, how can i
>> ignore some part of the corpus ?
>>
>> Best
>>
>> _______________________________________________
>> Moses-support mailing list
>> Moses-support@mit.edu
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>
>>
>
>
> --
> Hieu Hoang
> Research Associate
> University of Edinburgh
> http://www.hoang.co.uk/hieu
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20140328/b6f3b7de/attachment-0001.htm

------------------------------

Message: 3
Date: Fri, 28 Mar 2014 17:49:07 +0000
From: Philipp Koehn <pkoehn@inf.ed.ac.uk>
Subject: Re: [Moses-support] Problems by training Moses
To: Ricardo Cabello S?nchez <ricardo.cabello.sanchez@googlemail.com>
Cc: "moses-support@mit.edu" <moses-support@mit.edu>
Message-ID:
<CAAFADDCsc1Dk56wAFS2c1LLfijK1RfKFKJV0v1BzQD6MArRnqw@mail.gmail.com>
Content-Type: text/plain; charset=ISO-8859-1

Hi,

can you check error messages in

steps/17/TRAINING_prepare-data.17.STDERR

and the contents of the directory

model/prepared.17

Most likely, you have not properly installed mkcls, but the error reporting
in the file above should tell you.

-phi



On Fri, Mar 28, 2014 at 4:57 PM, Ricardo Cabello S?nchez
<ricardo.cabello.sanchez@googlemail.com> wrote:
> Dear all,
>
>
> I'm new in this mailing list. Sorry if I am asking something very obvious
> and easy but I'm pretty new in this field.
>
> I'm starting working with Moses for my PhD in computational linguistics and
> now, once I think installation went ok, I am trying to train Moses with
> provided default corpora. Process crashes in the early preparation data
> step. I have been solving other errors but now I think I would need help
> with this. I get these errors (the whole process pasted below) that I can
> solve.
>
> Could you please help?
>
> <<<
> ricardo@ricardo-Satellite-L40:~/mosesdecoder/primera_vez$ ./experiment.perl
> -config config_primera_vez.toy -exec
> STARTING UP AS PROCESS 4267 ON ricardo-Satellite-L40 AT dom mar 23 12:34:43
> CET 2014
> LOAD CONFIG...
> working directory is /home/ricardo/mosesdecoder/primera_vez
> running experimenal run number 17
>
> ESTABLISH WHICH STEPS NEED TO BE RUN
>
> FIND DEPENDENCIES BETWEEN STEPS
>
> CHECKING IF OLD STEPS ARE RE-USABLE
>
> STEP SUMMARY:
> 59 CORPUS:toy:tokenize -> re-using (1)
> 58 CORPUS:toy:clean -> re-using (1)
> 54 CORPUS:toy:truecase -> re-using (1)
> 49 TRUECASER:consolidate -> re-using (1)
> 48 TRUECASER:train -> re-using (1)
> 47 LM:toy:tokenize -> re-using (1)
> 45 LM:toy:truecase -> re-using (1)
> 43 LM:toy:train -> re-using (1)
> 40 LM:toy:binarize -> re-using (1)
> 39 TRAINING:consolidate -> re-using (1)
> 38 TRAINING:prepare-data -> run
> 37 TRAINING:run-giza -> run
> 36 TRAINING:run-giza-inverse -> run
> 35 TRAINING:symmetrize-giza -> run
> 34 TRAINING:build-lex-trans -> run
> 31 TRAINING:extract-phrases -> run
> 30 TRAINING:build-reordering -> run
> 29 TRAINING:build-ttable -> run
> 26 TRAINING:create-config -> run
> 24 TUNING:apply-weights -> run
> 23 EVALUATION:test:input-from-sgm -> re-using (1)
> 22 EVALUATION:test:tokenize-input -> re-using (1)
> 17 EVALUATION:test:truecase-input -> re-using (1)
> 15 EVALUATION:test:filter -> run
> 14 EVALUATION:test:apply-filter -> run
> 13 EVALUATION:test:decode -> run
> 12 EVALUATION:test:remove-markup -> run
> 10 EVALUATION:test:detruecase-output -> run
> 9 EVALUATION:test:detokenize-output -> run
> 8 EVALUATION:test:wrap -> run
> 7 EVALUATION:test:reference-from-sgm -> re-using (1)
> 6 EVALUATION:test:tokenize-reference -> re-using (1)
> 4 EVALUATION:test:nist-bleu -> run
> 3 EVALUATION:test:nist-bleu-c -> run
> 2 EVALUATION:test:analysis -> run
> 1 EVALUATION:test:analysis-coverage -> run
> 0 REPORTING:report -> run
> convert: iCCP: profile 'default_rgb.icc': 0h: PCS illuminant is not D50
> `/tmp/magick-4283knebIQusILAg1' @
> warning/png.c/MagickPNGWarningHandler/1830.
> Warning: Cannot convert string
> "-*-Helvetica-Medium-R-Normal--*-140-*-*-P-*-ISO8859-1" to type FontStruct
> Warning: Cannot convert string
> "-*-Helvetica-Medium-R-Normal--*-120-*-*-P-*-ISO8859-1" to type FontStruct
> Warning: Cannot convert string
> "-*-Helvetica-Medium-R-Normal--*-100-*-*-P-*-ISO8859-1" to type FontStruct
> Warning: Cannot convert string
> "-*-Helvetica-Bold-R-Normal--*-120-*-*-P-*-ISO8859-1" to type FontStruct
> convert: iCCP: profile 'default_rgb.icc': 0h: PCS illuminant is not D50
> `/tmp/magick-4292U5L_nYYBb8pG1' @
> warning/png.c/MagickPNGWarningHandler/1830.
>
> EXECUTE STEPS
> number of steps doable or running: 1 at dom mar 23 12:34:45 CET 2014
> doable: TRAINING:prepare-data
> executing
> /home/ricardo/mosesdecoder/primera_vez/steps/17/TRAINING_prepare-data.17 via
> sh (1 active)
> convert: iCCP: profile 'default_rgb.icc': 0h: PCS illuminant is not D50
> `/tmp/magick-4307QgvlrdxUKXXc1' @
> warning/png.c/MagickPNGWarningHandler/1830.
> step TRAINING:prepare-data crashed
> number of steps doable or running: 0 at dom mar 23 12:34:54 CET 2014
> convert: iCCP: profile 'default_rgb.icc': 0h: PCS illuminant is not D50
> `/tmp/magick-4333yx8taQhVp75g1' @
> warning/png.c/MagickPNGWarningHandler/1830.
>
>>>>
>
> Thank you and best regards,
>
> Ricardo
>
>
>
>
>
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>



------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 89, Issue 74
*********************************************

0 Response to "Moses-support Digest, Vol 89, Issue 74"

Post a Comment