Moses-support Digest, Vol 127, Issue 15

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. TRAINING_extract-phrases ERROR: malformed XML (Ergun Bicici)
2. Re: TRAINING_extract-phrases ERROR: malformed XML (Ergun Bicici)
3. Parallel Subsampling (Sanjanashree Palanivel)
4. EMS transliteration crashes (Rofael Behnam)


----------------------------------------------------------------------

Message: 1
Date: Thu, 11 May 2017 10:32:01 +0300
From: Ergun Bicici <bicici@gmail.com>
Subject: [Moses-support] TRAINING_extract-phrases ERROR: malformed XML
To: moses-support <moses-support@mit.edu>
Message-ID:
<CAB59qTPAz98QY6VYjEr8Ca2hwb-Rcniqk8VF64HChof=71cbpw@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

TRAINING_extract-phrases is giving
ERROR: malformed XML: Wirtschaftsjahr Betriebsgr?sse < 50.000 kg 120.000 kg
ERROR: malformed XML: < ! -- / * Font Definitions *

etc.

this appears to be due to the tokenization of html tags.

Is there an option of Moses to handle these?

--

Regards,
Ergun

Ergun Bi?ici
http://bicici.github.com/ <http://ergunbicici.blogspot.com/>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20170511/5920ca8e/attachment-0001.html

------------------------------

Message: 2
Date: Thu, 11 May 2017 10:33:37 +0300
From: Ergun Bicici <bicici@gmail.com>
Subject: Re: [Moses-support] TRAINING_extract-phrases ERROR: malformed
XML
To: moses-support <moses-support@mit.edu>
Message-ID:
<CAB59qTNiUsGqoagto0drZPef7SfdCUDpV+wd_JqsYokOz5agtw@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Similarly:
ERROR: some opened tags were never closed: it shares some features in
common with the SGML < ! [ CDATA [ ] ] > construct , in that it declares a
block of text which is not for parsing .


On Thu, May 11, 2017 at 10:32 AM, Ergun Bicici <bicici@gmail.com> wrote:

>
> TRAINING_extract-phrases is giving
> ERROR: malformed XML: Wirtschaftsjahr Betriebsgr?sse < 50.000 kg 120.000
> kg
> ERROR: malformed XML: < ! -- / * Font Definitions *
>
> etc.
>
> this appears to be due to the tokenization of html tags.
>
> Is there an option of Moses to handle these?
>
> --
>
> Regards,
> Ergun
>
> Ergun Bi?ici
> http://bicici.github.com/ <http://ergunbicici.blogspot.com/>
>



--

Regards,
Ergun

Ergun Bi?ici
http://bicici.github.com/ <http://ergunbicici.blogspot.com/>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20170511/5fb74b72/attachment-0001.html

------------------------------

Message: 3
Date: Thu, 11 May 2017 17:13:31 +0530
From: Sanjanashree Palanivel <sanjanashree@gmail.com>
Subject: [Moses-support] Parallel Subsampling
To: moses-support@mit.edu
Message-ID:
<CAAc_kp5b4gZL9ErvkPBqwc=1M5uvxWr7LGKOpN8Dtc2R-Lh7Hg@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Dear All,

What is parallel subsampling in bilingual corpus. IT would be great if
I am getting an earnest reply

--
Thanks and regards,

Sanjanasri J.P
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20170511/6dacf064/attachment-0001.html

------------------------------

Message: 4
Date: Thu, 11 May 2017 14:58:58 +0300
From: Rofael Behnam <rofaelemil@gmail.com>
Subject: [Moses-support] EMS transliteration crashes
To: moses-support@mit.edu
Message-ID:
<CANEmP-V-TKj7wL7WnB1NDeZNVb0M=JoyBxW90K6ze4FNpDJc8A@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Hello,
Hope all is good.
I'm trying to use Moses with Farasa for Arabic-English Transliteration, but
Moses crashes during building transliteration model.
kindly find attached archive for step 23 with error, stdout and details.
please, help me to run transliteration using Moses with Arabic and English.
I appreciate your understanding.
Thanks in advance.

Best Regards,
*Rofael Emil Fayez Behnam*
Bachelor of Science in Computer and Systems Engineering,
Faculty of Engineering,
Alexandria University,
Egypt.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20170511/da3e4c19/attachment.html
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 23.zip
Type: application/zip
Size: 61789 bytes
Desc: not available
Url : http://mailman.mit.edu/mailman/private/moses-support/attachments/20170511/da3e4c19/attachment.zip

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 127, Issue 15
**********************************************

0 Response to "Moses-support Digest, Vol 127, Issue 15"

Post a Comment