Moses-support Digest, Vol 89, Issue 56

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Fwd: ICON-2014: 11th Intl Conf on Natural Language Processing
-- First Call for Papers (Vishal Goyal (????? ????))
2. Re: Loading a binary rule table (Massinissa Ahmim)


----------------------------------------------------------------------

Message: 1
Date: Fri, 21 Mar 2014 19:25:59 +0530
From: Vishal Goyal(????? ????) <vishal.pup@gmail.com>
Subject: [Moses-support] Fwd: ICON-2014: 11th Intl Conf on Natural
Language Processing -- First Call for Papers
To: moses-support <moses-support@mit.edu>, "mt-list@eamt.org"
<mt-list@eamt.org>, Ajit Kumar <ajit8671@gmail.com>, corpora
<corpora@uib.no>, Maxim Khalilov <maxkhalilov@gmail.com>, rakesh kumar
<rakesh77kumar@yahoo.com>, Manish Bansal <manish21578@gmail.com>,
Manish Bhambri <mbhambri0101@gmail.com>, Aditi Sharan
<aditisharan@gmail.com>, Girish Nath Jha <girishjha@gmail.com>,
Umrinder Pal <umrinderpal@gmail.com>, Lalit Goyal
<goyal.aqua@gmail.com>, shashi jindal <jindalshashi@yahoo.com>, Rajeev
Puri <rajeev_puri@hotmail.com>, Rupinder Sandhu
<rsandhu_18@yahoo.com>, Shubhnandan Singh Jamwal
<jamwalsnj@gmail.com>, Basant Agarwal <thebasant@gmail.com>, Vimal Kr
Soni <ruvimals@gmail.com>, Gurpreet Lehal <gslehal@gmail.com>,
Gurinder Gosal <gosal.gps@gmail.com>, Gurpreet Josan
<josangurpreet@rediffmail.com>, Kamal Deep Garg
<kamal.1cse@gmail.com>, "rajesh M.Tech. Student"
<rajeshsharma1234@gmail.com>
Message-ID:
<CADAAq5wMvvgL=u25qkx7t=VDanhxffU=ZHK6b170MWZzU6=nFw@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

---------- Forwarded message ----------
From: Lexical Resource Egroup <lr_egroup@mail.iiit.ac.in>
Date: Fri, Mar 21, 2014 at 4:00 PM
Subject: ICON-2014: 11th Intl Conf on Natural Language Processing -- First
Call for Papers
To:


ICON-2014: ELEVENTH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE
PROCESSING
Goa University, Goa, India

December 18-21, 2014

Organized by

NLP Association, India
International Institute of Information Technology, Hyderabad
Goa University, Goa
Linguistic Data Consortium for Indian Languages, CIIL, Mysore


FIRST CALL FOR PAPERS


The Eleventh International Conference on Natural Language Processing
(ICON-2014) will be held at Goa University, Goa, India during December
18-21, 2014. The ICON conference series is a forum for promoting
interaction among researchers in the field of Natural Language Processing
in India and abroad. The main conference is on December 19-20 2014. This
will be preceded by one day of pre-conference tutorials/workshops on
December 18, 2014 and post conference tutorials/workshops on December 21,
2014.

1. TOPICS:

Papers are invited on substantial, original and unpublished research on all
aspects of Natural Language Processing, with a particular focus on Indian
languages, issues, and applications relevant to India. The areas of
interest include, but are not limited to:

Morphology Parsing
Phonology Word Sense Disambiguation
Syntax Machine Translation
Semantics Information Retrieval
Discourse Text Summarization
Pragmatics Question Answering
Statistical Methods Dialog Systems
Knowledge-based Methods Performance Evaluation
Annotated Corpora Speech Corpora
Lexical Resources Speech Recognition
Ontology Speech Synthesis
POS Tagging


The authors have to submit papers under any of the areas mentioned above
and must mark the topic of their paper at the time of submission.

1.1 The conference will also accept papers for the following special
tracks:
- Natural Language Software Engineering
- Linguistic Engineering

1.2 Position Papers
A paper which expresses an opinion or point of view affecting
the field.

2. FORMAT OF SUBMISSION:

Papers in English, not exceeding 10 pages, should be submitted on the
online portal at www.iiit.ac.in/icon2014. Papers should include an abstract
of about 100-200 words. Please see the style file at
www.aclweb.org/downloads/acl-ftp/Styfiles/Proceedings/

BLIND REVIEW:

Papers in electronic form in the PDF format, anonymous for blind review,
should be submitted. Kindly ensure that authors' names and affiliations are
given only on a separate cover sheet. If your paper contains text of
languages other than English, please attach relevant font files along with
your submission.

3. CALL FOR TUTORIALS/WORKSHOPS:

Proposals are invited for pre-conference tutorials/workshops.
Tutorials/Workshops can be of half-day or full-day duration. The proposal
should be presented in the form of a 200-word abstract, one page topical
outline of the content, description of the proposers and their
qualifications relating to the tutorial content.

Workshops on Linguistic aspects of South Asian Languages are also welcome.

Send tutorial/Workshop proposals to the ICON-2014 Secretariat. For further
information, please refer to the Conference URL or contact the ICON-2014
Secretariat.


4. NLP TOOLS CONTEST ON STATISTICAL MACHINE TRANSLATION IN INDIAN LANGUAGES

Machine translation (MT) is the process of encoding the syntactic and
semantic information of a source language text into a target language. In
past two decades, MT has shown very promising results particularly using
Statistical Machine Translation (SMT) especially for English and other
European Languages. However, its effectiveness in translating sentences
within Indian Languages (IL) and between English and Indian languages needs
to be explored further.

The NLP tool contest in ICON 2014 aims to collectively explore the
effectiveness of SMT while translating within ILs and between English and
ILs.

CONTEST:

In the contest, training data will be provided to the contestants. It will
consist of parallel corpus for different ILs and English. The contestants
will have to train their systems on the data. A development corpus will
also be provided to refine and improve their system. The final contest will
be held on November, 2014 with the test data. A workshop will be held as a
part of ICON to allow the short listed candidates to present their
techniques and results. The detail of the language pair will be announced
shortly. We will test translation in both directions for all language pairs.

The detail of the evaluation procedure and the uses policy of additional
resources/tools will be announced shortly.

The contest will have three prizes:
FIRST PRIZE: Rs.10,000/-
SECOND PRIZE: Rs.7,500/-
THIRD PRIZE: Rs.5,000/-


5. STUDENT PAPER COMPETITION IN LANGUAGE TECHNOLOGIES

ICON-2014 announces STUDENT PAPER COMPETITION in two tracks:
Track I : NLP (All areas)
Track II : Linguistics (Morphology, Syntax and Semantics)

Papers may be submitted under the link on the web page. Prizes will be
awarded in each track for up to two papers based on original work carried
out. The prizes are::

FIRST PRIZE: Rs.10,000/-
SECOND PRIZE: Rs.7,500/-
THIRD PRIZE: Rs.5,000/-

The short-listed papers in each track will be invited for presentation in a
special session in the conference. Registration, domestic travel and
subsistence expenses will be provided by the conference organizers for one
author of each paper. Up to two winners will be offered summer fellowships
at major NLP Centres in India. For any clarifications, contact Student
Paper Competition Chair on icon2014@iiit.ac.in.


6. IMPORTANT DATES:

Paper submission deadline Aug 10, 2014
Paper acceptance notification Oct 15, 2014
Camera ready copy submission Oct 31, 2014

Tutorial/Workshop proposals Aug 20, 2014
Tutorial/Workshop acceptance
notification Aug 31, 2014

NLP Tools Contest registration
deadline Aug 7, 2014
Student Paper Competition
submission deadline Aug 15, 2014



7. COMMITTEES:

Advisory Committee Chair
Prof Aravind K Joshi, University of Pennsylvania, USA

Conference General Chair
Rajeev Sangal, IIT BHU, India

Programme Committee
Jyoti Pawar, Goa University, India (Chair)
Dipti Misra Sharma, IIIT Hyderabad, India (Co-Chair)

Tools Contest Chair
Sandipan Dandapat, IIT Guwahati, India

Student Paper Competition Chair
Vishal Goyal, Punjabi University, India

Organizing Chair
Ramdas N. Karmali, Goa University, India


8. CONTACT INFORMATION

ICON-2014 Secretariat
Language Technologies Research Centre
International Institute of Information Technology
Gachibowli, Hyderabad - 500 032, India
Ph: +91-40-6653 1333, 6653 1144 Fax: +91-400-6653 1413
e-mail: icon2014@iiit.ac.in
URL: www.iiit.ac.in/icon2014




--
*Regards,*
Vishal Goyal,
Ph.D., M.Tech., MCA, M.C.S.D.
Assistant Professor(Stage III) and Placement Coordinator,
Department of Computer Science,
Punjabi University Patiala-147002
[*Online Hindi to Punjabi Machine Translation Tool -*
http://h2p.learnpunjabi.org ]
*[Research Cell: An International Journal of Engineering Sciences,
http://ijoes.vidyapublications.com <http://ijoes.vidyapublications.com>]*
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20140321/ad5de21c/attachment-0001.htm

------------------------------

Message: 2
Date: Sat, 22 Mar 2014 11:10:15 +0100
From: Massinissa Ahmim <massinissa.ahmim@linguacustodia.com>
Subject: Re: [Moses-support] Loading a binary rule table
To: Hieu Hoang <Hieu.Hoang@ed.ac.uk>
Cc: "moses-support@mit.edu" <moses-support@mit.edu>
Message-ID:
<CANN0mWakXgKZKqgTTJOz7kwmz90-F4AOaDXEE6RtyQOZxwgxsg@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"

Hi Hieu,

I used the same binarisation command as the one stated here-above

I get this error when I try to translate something like : echo 'ceci est un
test' | /mosesdecoder/bin/moses_chart -f moses.ini

Here is my moses.ini :

#########################
### MOSES CONFIG FILE ###
#########################

# input factors
[input-factors]
0

# mapping steps
[mapping]
0 T 0
1 T 1

# translation tables: table type (hierarchical(0), textual (0), binary
(1)), source-factors, target-factors, number of scores, file
# OLD FORMAT is still handled for back-compatibility
# OLD FORMAT translation tables: source-factors, target-factors, number of
scores, file
# OLD FORMAT a binary table type (1) is assumed
[ttable-file]
6 0 0 1 /disque2/Preparation/syntactic/hierarchical_PSCT/rules-table
6 0 0 1 /disque2/Preparation/syntactic/hierarchical_PSCT/model/glue-grammar

# no generation models, no generation-file section

# language models: type(srilm/irstlm), factors, order, file
[lmodel-file]
0 0 5 /disque2/Preparation/syntactic/hierarchical/LM.sur.en.blm


# limit on how many phrase translations e for each phrase f are loaded
# 0 = all elements loaded
[ttable-limit]
20


# language model weights
[weight-l]
0.5000


# translation model weights
[weight-t]
0.20
0.20
0.20
0.20
0.20
1.0

# no generation models, no weight-generation section

# word penalty
[weight-w]
-1

[cube-pruning-pop-limit]
1000

[non-terminals]
X

[search-algorithm]
3

[inputtype]
3

[max-chart-span]
20
1000



2014-03-21 20:49 GMT+01:00 Hieu Hoang <Hieu.Hoang@ed.ac.uk>:

> i've never seen that error before. Did you change anything else, other
> than rebinarizing the phrase-table? What is the exact command you ran when
> you got this error? Can I have a look at your moses.ini file
>
>
> On 21 March 2014 18:32, Massinissa Ahmim <
> massinissa.ahmim@linguacustodia.com> wrote:
>
>> Thank you Hieu,
>>
>> I've binarised my rule-table again and it seems to be successfully loaded
>> but I'm getting this message now :
>>
>> Reading /disque2/Preparation/syntactic/hierarchical_PSCT/rules-table
>>
>> ----5---10---15---20---25---30---35---40---45---50---55---60---65---70---75---80---85---90---95--100
>>
>> ****************************************************************************************************
>> Exception: <fd:3>: Is a directory from zlib
>>
>> What can I do?
>>
>> Thanks !
>>
>> Regards
>>
>> Massinissa
>>
>>
>> 2014-03-21 11:18 GMT+01:00 Hieu Hoang <Hieu.Hoang@ed.ac.uk>:
>>
>> everything seems to be ok. Are you sure the server you are running on has
>>> access to the directory
>>> /disque2/Preparation/syntactic/hierarchical_PSCT/rule-table
>>>
>>> Also, you should consider updating your version of Moses to the current
>>> github version, or version 2.1.1
>>>
>>>
>>> On 20 March 2014 11:20, Massinissa Ahmim <
>>> massinissa.ahmim@linguacustodia.com> wrote:
>>>
>>>> Dear all,
>>>>
>>>> I've managed to train a hierarchical model using the following command :
>>>>
>>>> nohup /mosesdecoder/scripts/training/train-model.perl --hierarchical
>>>> --glue-grammar --score-options="--GoodTuring" -root-dir
>>>> /disque2/Preparation/syntactic/hierarchical_PSCT -corpus
>>>> /disque2/Preparation/backoff/PSCT.tok.uni.low -f fr -e en -lm
>>>> 0:5:/disque2/Preparation/syntactic/hierarchical/LM.sur.en.blm
>>>> -external-bin-dir /root/external-bin-dir/ -mgiza -mgiza-cpus 30 >&
>>>> training.out &
>>>>
>>>> The resulting model works great but it is very slow.
>>>>
>>>> So I used CreatOnDiskPt to binarise the rule table as follows :
>>>>
>>>> /home/Moses/mosesdecoder/bin/CreateOnDiskPt 1 1 5 20 2
>>>> model/rule-table.gz rules-table
>>>>
>>>> This outputs the following files
>>>>
>>>> -rw-r--r-- 1 root root 85 20 mars 11:35 Misc.dat
>>>> -rw-r--r-- 1 root root 781509322 20 mars 11:35 Source.dat
>>>> -rw-r--r-- 1 root root 1758588137 20 mars 11:35 TargetColl.dat
>>>> -rw-r--r-- 1 root root 1555353459 20 mars 11:35 TargetInd.dat
>>>> -rw-r--r-- 1 root root 400714 20 mars 11:35 Vocab.dat
>>>>
>>>> and I've updated my moses.ini as follows :
>>>>
>>>> 6 0 0 1 /disque2/Preparation/syntactic/hierarchical_PSCT/rule-table
>>>> 6 0 0 1
>>>> /disque2/Preparation/syntactic/hierarchical_PSCT/model/glue-grammar
>>>>
>>>> But as I try to use it, I'm getting this :
>>>>
>>>> Defined parameters (per moses.ini or switch):
>>>> config: moses.ini
>>>> cube-pruning-pop-limit: 1000
>>>> input-factors: 0
>>>> inputtype: 3
>>>> lmodel-file: 0 0 5
>>>> /disque2/Preparation/syntactic/hierarchical/LM.sur.en.blm
>>>> mapping: 0 T 0 1 T 1
>>>> max-chart-span: 20 1000
>>>> non-terminals: X
>>>> search-algorithm: 3
>>>> ttable-file: 6 0 0 1
>>>> /disque2/Preparation/syntactic/hierarchical_PSCT/rule-table 6 0 0 1
>>>> /disque2/Preparation/syntactic/hierarchical_PSCT/model/glue-grammar
>>>> ttable-limit: 20
>>>> weight-l: 0.5000
>>>> weight-t: 0.20 0.20 0.20 0.20 0.20 1.0
>>>> weight-w: -1
>>>> /mosesdecoder/bin
>>>> ScoreProducer: WordPenalty start: 0 end: 1
>>>> ScoreProducer: !UnknownWordPenalty start: 1 end: 2
>>>> Loading lexical distortion models...have 0 models
>>>> Start loading LanguageModel
>>>> /disque2/Preparation/syntactic/hierarchical/LM.sur.en.blm : [0.000] seconds
>>>> /disque2/Preparation/syntactic/hierarchical/LM.sur.en.blm: line 80492:
>>>> reached EOF before \end\
>>>> ScoreProducer: LM start: 2 end: 3
>>>> Finished loading LanguageModels : [0.056] seconds
>>>> Using uniform ttable-limit of 20 for all translation tables.
>>>> Start loading PhraseTable
>>>> /disque2/Preparation/syntactic/hierarchical_PSCT/rule-table : [0.056]
>>>> seconds
>>>> filePath: /disque2/Preparation/syntactic/hierarchical_PSCT/rule-table
>>>> ScoreProducer: PhraseModel start: 3 end: 4
>>>> Start loading PhraseTable
>>>> /disque2/Preparation/syntactic/hierarchical_PSCT/model/glue-grammar :
>>>> [0.056] seconds
>>>> filePath:
>>>> /disque2/Preparation/syntactic/hierarchical_PSCT/model/glue-grammar
>>>> ScoreProducer: PhraseModel:2 start: 4 end: 5
>>>> Finished loading phrase tables : [0.056] seconds
>>>> max-chart-span: 20
>>>> max-chart-span: 1000
>>>> Start loading phrase table from
>>>> /disque2/Preparation/syntactic/hierarchical_PSCT/rule-table : [0.056]
>>>> seconds
>>>> Can't read /disque2/Preparation/syntactic/hierarchical_PSCT/rule-table
>>>>
>>>> So I wondering if I did something wrong with my training
>>>> command/binarisation or with the parameters in the moses.ini
>>>>
>>>>
>>>> Many thanks
>>>>
>>>> Regads
>>>>
>>>> MA
>>>>
>>>>
>>>>
>>>>
>>>> --
>>>>
>>>> [image: Description : Description : lingua_custodia_final full logo]
>>>>
>>>> *The Translation Trustee*
>>>>
>>>> *1, Place Charles de Gaulle*
>>>>
>>>> *78180 Montigny-le-Bretonneux*
>>>>
>>>> *Tel : +33 1 30 44 04 23 Mobile : +33 7 61 44 40 84*
>>>>
>>>> *Email :* *massinissa.ahmim@linguacustodia.com
>>>> <massinissa.ahmim@linguacustodia.com>*
>>>>
>>>> *Website :* *www.linguacustodia.com
>>>> <http://www.linguacustodia.com/> - www.thetranslationtrustee.com
>>>> <http://www.thetranslationtrustee.com>*
>>>>
>>>> ? Pensez ? l'environnement, n'imprimez ce courriel que si n?cessaire.
>>>>
>>>> Please do not print this email unless it is absolutely necessary. Spread
>>>> environmental awareness.
>>>>
>>>> _______________________________________________
>>>> Moses-support mailing list
>>>> Moses-support@mit.edu
>>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>
>>>>
>>>
>>>
>>> --
>>> Hieu Hoang
>>> Research Associate
>>> University of Edinburgh
>>> http://www.hoang.co.uk/hieu
>>>
>>>
>>
>>
>> --
>>
>> [image: Description : Description : lingua_custodia_final full logo]
>>
>> *The Translation Trustee*
>>
>> *1, Place Charles de Gaulle*
>>
>> *78180 Montigny-le-Bretonneux*
>>
>> *Tel : +33 1 30 44 04 23 Mobile : +33 7 61 44 40 84*
>>
>> *Email :* *massinissa.ahmim@linguacustodia.com
>> <massinissa.ahmim@linguacustodia.com>*
>>
>> *Website :* *www.linguacustodia.com <http://www.linguacustodia.com/> -
>> www.thetranslationtrustee.com <http://www.thetranslationtrustee.com>*
>>
>> ? Pensez ? l'environnement, n'imprimez ce courriel que si n?cessaire.
>>
>> Please do not print this email unless it is absolutely necessary. Spread
>> environmental awareness.
>>
>
>
>
> --
> Hieu Hoang
> Research Associate
> University of Edinburgh
> http://www.hoang.co.uk/hieu
>
>


--

[image: Description : Description : lingua_custodia_final full logo]

*The Translation Trustee*

*1, Place Charles de Gaulle*

*78180 Montigny-le-Bretonneux*

*Tel : +33 1 30 44 04 23 Mobile : +33 7 61 44 40 84*

*Email :* *massinissa.ahmim@linguacustodia.com
<massinissa.ahmim@linguacustodia.com>*

*Website :* *www.linguacustodia.com <http://www.linguacustodia.com/> -
www.thetranslationtrustee.com <http://www.thetranslationtrustee.com>*

? Pensez ? l'environnement, n'imprimez ce courriel que si n?cessaire.

Please do not print this email unless it is absolutely necessary. Spread
environmental awareness.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20140322/9a9bf447/attachment.htm
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.jpg
Type: image/jpeg
Size: 4421 bytes
Desc: not available
Url : http://mailman.mit.edu/mailman/private/moses-support/attachments/20140322/9a9bf447/attachment.jpg

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 89, Issue 56
*********************************************

0 Response to "Moses-support Digest, Vol 89, Issue 56"

Post a Comment