Send Moses-support mailing list submissions to
moses-support@mit.edu
To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu
You can reach the person managing the list at
moses-support-owner@mit.edu
When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."
Today's Topics:
1. Creating a 2-gram language model (Rajkiran Rajkumar)
2. Help about Using Giza++ for comparable Corpora
(alireza tabebordbar)
----------------------------------------------------------------------
Message: 1
Date: Thu, 19 Jun 2014 11:49:21 +0530
From: Rajkiran Rajkumar <rajkiran2507@gmail.com>
Subject: [Moses-support] Creating a 2-gram language model
To: moses-support@mit.edu
Message-ID:
<CAHMStsJNs6pFSFuS4EkfAYMw52PKxSrampuxOK9V13v9yVVUzg@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"
In the baseline translation system, the command -
"~/irstlm/bin/build-lm.sh \
-i news-commentary-v8.fr-en.sb.en \
-t ./tmp -p -s improved-kneser-ney -o news-commentary-v8.fr-en.lm.en"
is used to create a "3-gram language model, removing singletons, smoothing
with improved Kneser-Ney, and adding sentence boundary symbols" is what the
tutorial says. How can I build a 2-gram language model?
And, in general, which is more efficient for a bilingual corpus of 160,000
sentences? 2-gram or 3-gram?
Thanks in advance,
Rajkiran
College of Engineering Guindy, India
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20140619/39479919/attachment-0001.htm
------------------------------
Message: 2
Date: Thu, 19 Jun 2014 15:50:59 +0430
From: alireza tabebordbar <ar.tabebordbar@gmail.com>
Subject: [Moses-support] Help about Using Giza++ for comparable
Corpora
To: Moses-support@mit.edu
Message-ID:
<CAAECi_FBOgVWcOOasTt8ap40hLrKqRyPUsDp_CeZ83+L5_OXjw@mail.gmail.com>
Content-Type: text/plain; charset=ISO-8859-1
Hi all
I am Master degree Student and I'm New to SMT.
I extracted some Comparable sentences and I want to use Giza++ for
word alignment.I Think Giza Use EM algorithm for aligning words,
However I don't know that I can feed comparable sentences directly to
Giza++ or I have combine them with some Parallel sentences.
------------------------------
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support
End of Moses-support Digest, Vol 92, Issue 37
*********************************************
Subscribe to:
Post Comments (Atom)
0 Response to "Moses-support Digest, Vol 92, Issue 37"
Post a Comment