Send Moses-support mailing list submissions to
moses-support@mit.edu
To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu
You can reach the person managing the list at
moses-support-owner@mit.edu
When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."
Today's Topics:
1. Re: Moses-support post from arnaud.gicquel@linguacustodia.com
requires approval (Hieu Hoang)
----------------------------------------------------------------------
Message: 1
Date: Tue, 18 Mar 2014 15:21:29 +0000
From: Hieu Hoang <Hieu.Hoang@ed.ac.uk>
Subject: Re: [Moses-support] Moses-support post from
arnaud.gicquel@linguacustodia.com requires approval
To: arnaud.gicquel@linguacustodia.com, moses-support
<moses-support@mit.edu>
Message-ID:
<CAEKMkbiYDuE1xyPypHME_H_GAiXcma30CNyo4F8YQAte6aeFzA@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"
please subscribe to the mailing list before posting to it. You can
subscribe here:
http://mailman.mit.edu/mailman/listinfo/moses-support
I don't really understand your questions. All characters are taken into
account by the decoder and the training algorithms.
There are some reserved characters that you must not use - [ ] | < > &
You might want to put something into a xml tag, eg. <private-xml
ff="....">. I think then the decoder will ignore it
On 18 March 2014 14:20, <moses-support-owner@mit.edu> wrote:
> As list administrator, your authorization is requested for the
> following mailing list posting:
>
> List: Moses-support@mit.edu
> From: arnaud.gicquel@linguacustodia.com
> Subject: Issue about sentence segmentation
> Reason: Post by non-member to a members-only list
>
> At your convenience, visit:
>
> http://mailman.mit.edu/mailman/admindb/moses-support
>
> to approve or deny the request.
>
>
> ---------- Forwarded message ----------
> From: Arnaud Gicquel <arnaud.gicquel@linguacustodia.com>
> To: moses-support@MIT.EDU
> Cc:
> Date: Tue, 18 Mar 2014 15:20:38 +0100
> Subject: Issue about sentence segmentation
>
> Hi all
>
> I am trying to develop a specific segmenter. The goal is to send Moses
> decoder sentences instead of large textual flows syntactically incoherent.
> In order to integrate this segmenter in an automatic workflow of documents
> translation. I would define as a sentence delimiter any character that the
> decoder does not take into account in its statistics.
>
> Is there a completely neutral character (I don't want it to be considered
> unknown) that I could use as a delimiter?
>
> Thank you for your help
>
> Arnaud Gicquel
>
> --
> Lingua Custodia
> 1, Place Charles de Gaulle
> 78180 Montigny le Bretonneux - France
> Tel : 33 1 30 44 04 23
> Email : arnaud.gicquel@linguacustodia.com
> Website : www.linguacustodia.com
>
>
> ---------- Forwarded message ----------
> From: moses-support-request@mit.edu
> To:
> Cc:
> Date:
> Subject: confirm d2337e56c58e533c4286cc73dbb52d9352c94e86
> If you reply to this message, keeping the Subject: header intact,
> Mailman will discard the held message. Do this if the message is
> spam. If you reply to this message and include an Approved: header
> with the list password in it, the message will be approved for posting
> to the list. The Approved: header can also appear in the first line
> of the body of the reply.
>
--
Hieu Hoang
Research Associate
University of Edinburgh
http://www.hoang.co.uk/hieu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20140318/967b16c1/attachment-0001.htm
------------------------------
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support
End of Moses-support Digest, Vol 89, Issue 40
*********************************************
Subscribe to:
Post Comments (Atom)
0 Response to "Moses-support Digest, Vol 89, Issue 40"
Post a Comment