Moses-support Digest, Vol 129, Issue 12

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Fwd: Moses-support post from ilknurdurgar@sabanciuniv.edu
requires approval (Hieu Hoang)


----------------------------------------------------------------------

Message: 1
Date: Tue, 11 Jul 2017 14:24:54 +0100
From: Hieu Hoang <hieuhoang@gmail.com>
Subject: [Moses-support] Fwd: Moses-support post from
ilknurdurgar@sabanciuniv.edu requires approval
To: ilknurdurgar@sabanciuniv.edu, moses-support
<moses-support@mit.edu>
Message-ID:
<CAEKMkbh6vEnt_dn=5y1H5n6_1pFZ7yigdDMqm+07uBETP0CS3A@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Please subscribe to the Moses mailing list before posting to it. You can
subscribe here:
http://mailman.mit.edu/mailman/listinfo/moses-support
To answer your question - you must escape special characters such as <>[]|
before giving it to the decoder. You can do this using the moses script
scripts/tokenizer/escape-special-chars.perl
After decoding, you can put those characters back in by using
scripts/tokenizer/deescape-special-chars.perl
You must escape the training, tuning and test data the same way

Hieu Hoang
http://moses-smt.org/


---------- Forwarded message ----------
From: <moses-support-owner@mit.edu>
Date: 10 July 2017 at 12:14
Subject: Moses-support post from ilknurdurgar@sabanciuniv.edu requires
approval
To: moses-support-owner@mit.edu


As list administrator, your authorization is requested for the
following mailing list posting:

List: Moses-support@mit.edu
From: ilknurdurgar@sabanciuniv.edu
Subject: "ERROR: malformed XML" during evaluation
Reason: Post by non-member to a members-only list

At your convenience, visit:

http://mailman.mit.edu/mailman/admindb/moses-support

to approve or deny the request.


---------- Forwarded message ----------
From: "Ilknur Durgar El-Kahlout (Alumni)" <ilknurdurgar@sabanciuniv.edu>
To: moses-support@mit.edu
Cc:
Bcc:
Date: Mon, 10 Jul 2017 13:14:30 +0200
Subject: "ERROR: malformed XML" during evaluation

Hi;

I want to build an Arabic-English SMT system. I preprocess the data, use an
Arabic tokenizer that converts Arabic to Buckwalter representation and then
train the system. I use Moses to decode and everything is OK with this
configuration. I successfully got the translations.

But when I want to use Moses-chart, it crashes with the above error,
tuning and test sets have sentences with "<" generated during the Bucwalter
conversion. I can not get rid of these characters as they are the part of
a word.

How can I force Moses-chart to ignore these chars?

Thanks in advance.


--ilknur


---------- Forwarded message ----------
From: moses-support-request@mit.edu
To:
Cc:
Bcc:
Date:
Subject: confirm 4be4359f1bab37462702f8cb6f94001d54175354
If you reply to this message, keeping the Subject: header intact,
Mailman will discard the held message. Do this if the message is
spam. If you reply to this message and include an Approved: header
with the list password in it, the message will be approved for posting
to the list. The Approved: header can also appear in the first line
of the body of the reply.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20170711/88c0d2fb/attachment-0001.html

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 129, Issue 12
**********************************************

0 Response to "Moses-support Digest, Vol 129, Issue 12"

Post a Comment