Moses-support Digest, Vol 84, Issue 25

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Re: Placeholders (support@precisiontranslationtools.com)
2. XLIFF support in the M4Loc project (John Tinsley)
3. Moses chart MERT crashes when run with ems (Eleftherios Avramidis)


----------------------------------------------------------------------

Message: 1
Date: Tue, 15 Oct 2013 19:25:00 -0700
From: support@precisiontranslationtools.com
Subject: Re: [Moses-support] Placeholders
To: <moses-support@mit.edu>
Message-ID:
<3ce4be384d2c75d8f754c3009c191e80@precisiontranslationtools.com>
Content-Type: text/plain; charset="utf-8"



A change from <anytag/> will no-doubt disrupt existing pipelines.
Communicating the change with the new release will be a great help.

On
2013-10-15 01:35, Hieu Hoang wrote:

> they're good ideas. I'll have a
think if I get round to doing it.
>
> Would also want to minimise the
work I have to do, and minimize the disruption to people's existing
pipeline.
>
> On 15 October 2013 01:33, Tom Hoar
<tahoar@precisiontranslationtools.com> wrote:
>
>> I agree that
<anytag/> could cause problems, especially with the growing
>> list of
reserved tag names (ne, wall, zone). I wholeheartedly support a
>> fixed
tag, but I'm not sure "option" is it. What about <np/> (already in
>>
the manual) or <xml-markup/> or <xml-input/> or <moses/>?
>>
>> Here's
another idea. The -xml-input flag supports values "exclusive,"
>>
"inclusive," "ignore" and "pass-through." What about changing the
flag
>> to a boolean flag. Then, use the value as the xml tags:
<exclusive/>,
>> <inclusive/> and <ignore/> so the one invocation of
Moses would support
>> all modes on a per-sentence basis. Just a
thought. Think this would also
>> be easier if you dropped the
"pass-through" option because no need for
>> backwards compatibility.
>>

>> Another idea, although slightly different subject. Moses'
>>
-monotone-at-punctuation flag would be more useful if we could
>>
define/override the punctuation & symbols that we want it to use. Not
>>
sure how to best accomplish this.
>>
>> Tom
>>
>> On 10/15/2013 04:07
AM, Hieu Hoang wrote:
>> > In fact, we're thinking of changing <anytag/>
to something fixed, like
>> > <option/>
>> >
>> > The <anytag/>
behaviour isn't good XML and will cause problems in the
>> > future
>>
>
>> > Any opinions on this gratefully received
>> >
>>
>>
_______________________________________________
>> Moses-support mailing
list
>> Moses-support@mit.edu
>>
http://mailman.mit.edu/mailman/listinfo/moses-support [1]
>
> --
>
Hieu Hoang
> Research Associate
> University of Edinburgh
>
http://www.hoang.co.uk/hieu [2]
>
>
_______________________________________________
> Moses-support mailing
list
> Moses-support@mit.edu
>
http://mailman.mit.edu/mailman/listinfo/moses-support [1]




Links:
------
[1]
http://mailman.mit.edu/mailman/listinfo/moses-support
[2]
http://www.hoang.co.uk/hieu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131015/b67666b4/attachment-0001.htm

------------------------------

Message: 2
Date: Wed, 16 Oct 2013 12:20:09 +0100
From: John Tinsley <jtinsley@computing.dcu.ie>
Subject: [Moses-support] XLIFF support in the M4Loc project
To: moses-support@mit.edu
Message-ID:
<CAHfkK=5BPLKf2zGO+XV9J-8kKEyt7VZaTLNAkaMO0vCMqori0A@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"

Hi folks,

I'm having a little trouble with XLIFF handling using some of the M4Loc
tools, specifically 'reinsert.pm' for replacing inline markup after
translation.(https://code.google.com/p/m4loc/wiki/Pod_reinsert)

It works fine for simple tags where the text between the tags *should* be
translated, e.g.

*src:* das ist ein <bx id="1">kleines haus</bx>
*tgt: *this is |0-1| a |2-2| small |3-3| house |4-4|

*output: *this is a <bx id="1"> small house </bx>

However, there are often examples of paired tags (kind of like markup
around markup) which are not handled, e.g.

das ist ein *<bpt id="1">&lt;b&gt;</bpt>*kleines haus*<ept
id="1">&lt;/b&gt;</ept>*

In this case, the <bpt> and <ept> tags are paired, and everything in
between both sets of tags should be stripped out, e.g. *&lt;b&gt; *but this
doesn't appear to be the case.

Is there another tool in the project that handles this kind of markup or is
it not supported?

Thanks
John

--
John Tinsley
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131016/bed76cda/attachment-0001.htm

------------------------------

Message: 3
Date: Wed, 16 Oct 2013 14:50:02 +0200
From: Eleftherios Avramidis <eleftherios.avramidis@dfki.de>
Subject: [Moses-support] Moses chart MERT crashes when run with ems
To: Moses-support <moses-support@mit.edu>
Message-ID: <525E8B7A.8030002@dfki.de>
Content-Type: text/plain; charset="iso-8859-1"

Hi

moses chart crashed, while trying to run tuning, as part of a default
ems pipeline. Exactly the same settings run perfectly for phrase-based
moses. The error was:


Start loading text SCFG phrase table. Moses format : [0.000] seconds

max-chart-span: 20

max-chart-span: 1000

Check obj->GetMisc("NumScores") == m_numScoreComponents failed in moses/TranslationModel/RuleTable/PhraseDictionaryOnDisk.cpp:91

Aborted (core dumped)

Exit code: 134

The decoder died. CONFIG WAS -weight-overwrite 'PhrasePenalty0= 0.057143 WordPenalty0= -0.285714 TranslationModel0= 0.057143 0.057143 0.057143 0.057143 TranslationModel1= 0.285714 LM0= 0.142857'


best
Lefteris

--
MSc. Inf. Eleftherios Avramidis
DFKI GmbH, Alt-Moabit 91c, 10559 Berlin
Tel. +49-30 238 95-1806

Fax. +49-30 238 95-1810

-------------------------------------------------------------------------------------------
Deutsches Forschungszentrum fuer Kuenstliche Intelligenz GmbH
Firmensitz: Trippstadter Strasse 122, D-67663 Kaiserslautern

Geschaeftsfuehrung:
Prof. Dr. Dr. h.c. mult. Wolfgang Wahlster (Vorsitzender)
Dr. Walter Olthoff

Vorsitzender des Aufsichtsrats:
Prof. Dr. h.c. Hans A. Aukes

Amtsgericht Kaiserslautern, HRB 2313
-------------------------------------------------------------------------------------------

-------------- next part --------------
A non-text attachment was scrubbed...
Name: TUNING_tune.14.STDERR
Type: application/x-extension-stderr
Size: 3769 bytes
Desc: not available
Url : http://mailman.mit.edu/mailman/private/moses-support/attachments/20131016/4b7c7b71/attachment.bin
-------------- next part --------------
A non-text attachment was scrubbed...
Name: run1.moses.ini
Type: application/x-extension-ini
Size: 1278 bytes
Desc: not available
Url : http://mailman.mit.edu/mailman/private/moses-support/attachments/20131016/4b7c7b71/attachment-0001.bin
-------------- next part --------------
#!/bin/bash

PATH="/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/home/elav01/.bin/bin/:/home/elav01/.local/bin:/home/elav01/.bin/bin/:/home/elav01/.local/bin"
cd /share/taraxu/systems/bmmt/20131009_MAN
echo 'starting at '`date`' on '`hostname`
mkdir -p /share/taraxu/systems/bmmt/20131009_MAN/tuning

/share/taraxu/software/moses-pack-2013/mosesdecoder/scripts/training/mert-moses.pl /share/taraxu/systems/bmmt/20131009_MAN/tuning/input.split.14 /share/taraxu/systems/bmmt/20131009_MAN/tuning/reference.tc.14 /share/taraxu/software/moses-pack-2013/mosesdecoder/bin/moses_chart /share/taraxu/systems/bmmt/20131009_MAN/tuning/moses.filtered.ini.14 --nbest 100 --working-dir /share/taraxu/systems/bmmt/20131009_MAN/tuning/tmp.14 --decoder-flags "-threads 8 -v 0 " --rootdir /share/taraxu/software/moses-pack-2013/mosesdecoder/scripts -mertdir /share/taraxu/software/moses-pack-2013/mosesdecoder/bin --no-filter-phrase-table
mkdir -p /share/taraxu/systems/bmmt/20131009_MAN/tuning
cp /share/taraxu/systems/bmmt/20131009_MAN/tuning/tmp.14/moses.ini /share/taraxu/systems/bmmt/20131009_MAN/tuning/moses.ini.14

echo 'finished at '`date`
touch /share/taraxu/systems/bmmt/20131009_MAN/steps/14/TUNING_tune.14.DONE

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 84, Issue 25
*********************************************

0 Response to "Moses-support Digest, Vol 84, Issue 25"

Post a Comment