Moses-support Digest, Vol 84, Issue 16

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Re: Placeholders (Per Tunedal)
2. Re: Placeholders (Hieu Hoang)
3. Re: Hiero ems should die on decode error, but doesn't (???)


----------------------------------------------------------------------

Message: 1
Date: Fri, 11 Oct 2013 09:08:43 +0200
From: Per Tunedal <per.tunedal@operamail.com>
Subject: Re: [Moses-support] Placeholders
To: moses-support@mit.edu
Message-ID:
<1381475323.19167.32709265.2D6F6277@webmail.messagingengine.com>
Content-Type: text/plain

Hi,
place holders would be useful. What's the implication of "it just needs
the XML parsing to run"?
Does this mean that the option could only be used with html-input? Not
with plain text?
Yours,
Per Tunedal

On Thu, Oct 10, 2013, at 15:30, Hieu Hoang wrote:
> On 10 October 2013 13:33, Nicola Bertoldi <bertoldi@fbk.eu> wrote:
>
> > Hi Hieu
> >
> > I read the documentation
> > and you mention that you enable the exclusive mode of xml-input
> >
> > I see few issues:
> >
> > - you mention that you enable the exclusive mode of xml-input;
> > this can conflict with other usage of xml-input which instead require
> > the inclusive mode.
> > do you have any comments on that?
> >
>
> it can be exclusive, inclusive or anything else except pass-through. It
> just requires the XML handling to run
>
>
> >
> > - when you use the exclusive mode you force the translation of the span
> > (@num@) with "100")
> > and other larger span including @num@ are not allowed
> > am I right?
> > If yes, what is the advantage of having phrase pairs including other
> > words
> >
>
> it doesn't create XML options, it just needs the XML parsing to run.
>
>
> >
> > - what is the meaning of "-placeholder-factor 1" ?
> >
> It stores the original text in the source factor 1. The placeholder
> symbol
> is in the factor 0, or whatever the translation model was configured to
> use.
>
>
> >
> >
> > Nicola Bertoldi
> >
> >
> >
> >
> > On Oct 10, 2013, at 1:05 PM, Hieu Hoang wrote:
> >
> > Hi all
> >
> > Achim and I have been working on adding support for placeholders into
> > Moses. That is, replacing a number, date, or named entity with a symbol eg.
> > @num@, -date-, =named-entity=. We think it would be especially useful for
> > commercial users of Moses, and for people translating text with lots of
> > numbers, dates etc.
> >
> > It is now supported in the Moses training and decoding pipeline. See the
> > following URL for more details.
> > http://www.statmt.org/moses/?n=Moses.AdvancedFeatures#ntoc60
> >
> > --
> > Hieu Hoang
> > Research Associate
> > University of Edinburgh
> > http://www.hoang.co.uk/hieu
> >
> > _______________________________________________
> > Moses-support mailing list
> > Moses-support@mit.edu<mailto:Moses-support@mit.edu>
> > http://mailman.mit.edu/mailman/listinfo/moses-support
> >
> >
> >
>
>
> --
> Hieu Hoang
> Research Associate
> University of Edinburgh
> http://www.hoang.co.uk/hieu
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support


------------------------------

Message: 2
Date: Fri, 11 Oct 2013 11:24:45 +0000
From: Hieu Hoang <Hieu.Hoang@ed.ac.uk>
Subject: Re: [Moses-support] Placeholders
To: Per Tunedal <per.tunedal@operamail.com>
Cc: moses-support <moses-support@mit.edu>
Message-ID:
<CAEKMkbjebUQjQ17r1YHwGcZrn=vhGWz5fm-OGgL2f86taap4Nw@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"

it doesn't process HTML. Just plain text, and XML markups to indicate the
placeholder

For example, your original sentence is
you owe me $ 100 .
After running it through the placeholder script, the sentence become
you owe me $ <ne translation="@num@" entity="100">@num@</ne> .

The XML processing is needed to parse this XML



On 11 October 2013 07:08, Per Tunedal <per.tunedal@operamail.com> wrote:

> Hi,
> place holders would be useful. What's the implication of "it just needs
> the XML parsing to run"?
> Does this mean that the option could only be used with html-input? Not
> with plain text?
> Yours,
> Per Tunedal
>
> On Thu, Oct 10, 2013, at 15:30, Hieu Hoang wrote:
> > On 10 October 2013 13:33, Nicola Bertoldi <bertoldi@fbk.eu> wrote:
> >
> > > Hi Hieu
> > >
> > > I read the documentation
> > > and you mention that you enable the exclusive mode of xml-input
> > >
> > > I see few issues:
> > >
> > > - you mention that you enable the exclusive mode of xml-input;
> > > this can conflict with other usage of xml-input which instead require
> > > the inclusive mode.
> > > do you have any comments on that?
> > >
> >
> > it can be exclusive, inclusive or anything else except pass-through. It
> > just requires the XML handling to run
> >
> >
> > >
> > > - when you use the exclusive mode you force the translation of the span
> > > (@num@) with "100")
> > > and other larger span including @num@ are not allowed
> > > am I right?
> > > If yes, what is the advantage of having phrase pairs including other
> > > words
> > >
> >
> > it doesn't create XML options, it just needs the XML parsing to run.
> >
> >
> > >
> > > - what is the meaning of "-placeholder-factor 1" ?
> > >
> > It stores the original text in the source factor 1. The placeholder
> > symbol
> > is in the factor 0, or whatever the translation model was configured to
> > use.
> >
> >
> > >
> > >
> > > Nicola Bertoldi
> > >
> > >
> > >
> > >
> > > On Oct 10, 2013, at 1:05 PM, Hieu Hoang wrote:
> > >
> > > Hi all
> > >
> > > Achim and I have been working on adding support for placeholders into
> > > Moses. That is, replacing a number, date, or named entity with a
> symbol eg.
> > > @num@, -date-, =named-entity=. We think it would be especially useful
> for
> > > commercial users of Moses, and for people translating text with lots of
> > > numbers, dates etc.
> > >
> > > It is now supported in the Moses training and decoding pipeline. See
> the
> > > following URL for more details.
> > > http://www.statmt.org/moses/?n=Moses.AdvancedFeatures#ntoc60
> > >
> > > --
> > > Hieu Hoang
> > > Research Associate
> > > University of Edinburgh
> > > http://www.hoang.co.uk/hieu
> > >
> > > _______________________________________________
> > > Moses-support mailing list
> > > Moses-support@mit.edu<mailto:Moses-support@mit.edu>
> > > http://mailman.mit.edu/mailman/listinfo/moses-support
> > >
> > >
> > >
> >
> >
> > --
> > Hieu Hoang
> > Research Associate
> > University of Edinburgh
> > http://www.hoang.co.uk/hieu
> > _______________________________________________
> > Moses-support mailing list
> > Moses-support@mit.edu
> > http://mailman.mit.edu/mailman/listinfo/moses-support
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>



--
Hieu Hoang
Research Associate
University of Edinburgh
http://www.hoang.co.uk/hieu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131011/791db569/attachment-0001.htm

------------------------------

Message: 3
Date: Fri, 11 Oct 2013 11:48:46 +0000 (GMT)
From: ??? <roarman.seo@samsung.com>
Subject: Re: [Moses-support] Hiero ems should die on decode error, but
doesn't
To: Philipp Koehn <pkoehn@inf.ed.ac.uk>, Lane Schwartz
<dowobeha@gmail.com>
Cc: "moses-support@mit.edu" <moses-support@mit.edu>
Message-ID: <14.82.21086.E95E7525@epcpsbgx4.samsung.com>
Content-Type: text/plain; charset="us-ascii"

An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131011/b1f0effd/attachment.htm
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: image/gif
Size: 14036 bytes
Desc: not available
Url : http://mailman.mit.edu/mailman/private/moses-support/attachments/20131011/b1f0effd/attachment.gif

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 84, Issue 16
*********************************************

0 Response to "Moses-support Digest, Vol 84, Issue 16"

Post a Comment