Moses-support Digest, Vol 84, Issue 13

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Our Kickstarter Project - an unique game that uses NLP at its
core (dev@northsideinc.com)
2. Postdoc positions in machine translation (Ant?nio Branco)
3. Phrase table format changed? (Petra Wolf)
4. Re: Phrase table format changed? (Rico Sennrich)
5. Re: Syntax Annotation Error Using "parse-en-collins.perl"
(Hieu Hoang)


----------------------------------------------------------------------

Message: 1
Date: Mon, 07 Oct 2013 14:55:33 -0700
From: <dev@northsideinc.com>
Subject: [Moses-support] Our Kickstarter Project - an unique game that
uses NLP at its core
To: moses-support@mit.edu
Message-ID:
<20131007145533.bb1c2b8baad8edd7cf1fdceb5f2355a2.f9db29364b.wbe@email02.secureserver.net>

Content-Type: text/plain; charset="us-ascii"

An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131007/26ea96fc/attachment-0001.htm

------------------------------

Message: 2
Date: Tue, 08 Oct 2013 08:56:28 +0100
From: Ant?nio Branco <Antonio.Branco@di.fc.ul.pt>
Subject: [Moses-support] Postdoc positions in machine translation
To: moses-support@mit.edu
Message-ID: <5253BAAC.2030903@di.fc.ul.pt>
Content-Type: text/plain; charset=UTF-8; format=flowed





(Apologies for cross-posting.)



Postdoc Positions in Machine Translation
at the Department of Informatics, University of Lisbon, Portugal



Applications are invited for 3-year Postdoc positions
in the area of Machine Translation. The positions are
funded through a new FP7 project, in the continuation of
the QTLaunchPad project, concerned with the exploitation of
novel deep language processing and linked open data approaches
to machine translation.

The research will be conducted at the University of Lisbon
with opportunities for collaborative work with other European
universities and industrial partners of the project, located
in Berlin, Groningen, Sofia, Prague and San Sebastian.

The application period will be open until the positions are filled.
with the starting dates for the positions occurring since
the beginning of November 2013.

The net monthly salary is in the range of 2060-2650 euro,
with 12 payments per year. Insurance and social security
contributions are ensured by the project.


Research Description

Successful candidate will be based in the Department of Informatics
of the University of Lisbon. The Department of Informatics consists
of more than 40 faculty members and over 50 PhD students.
Members of the department are actively pursuing a variety of research
initiatives, including machine translation, natural language
processing, human computer interaction, social network analysis,
bioinformatics, multi-agent systems, etc


Requirements

Applicants must have a PhD in computer science, artificial intelligence,
computational linguistics or a closely related area, with focus
on machine translation. In addition, applicants should have a solid track
record in publications for natural language processing; solid implementation
skills in C++, Java, scripting languages; knowledge of both statistical
and rule-based methods for natural language processing.

We are looking for candidates that have good communication skills,
both oral and written, enjoy to work in a closely collaborating team
in view of executing an exciting work plan of a multi-year, multi-partner
international project.


How to apply:

Please send a cover letter, a curriculum vitae (including a list
of publications), and the names and contact information of at least
two referees to Ant?nio Branco <Antonio.Branco@di.fc.lu.pt>.












------------------------------

Message: 3
Date: Tue, 8 Oct 2013 10:11:57 +0000
From: Petra Wolf <petra.wolf@lucysoftware.com>
Subject: [Moses-support] Phrase table format changed?
To: "moses-support@mit.edu" <moses-support@mit.edu>
Message-ID:
<12A5559653432E41AF377B7F3631BBCC0354EC73@MUC-EX.lucysoftware.com>
Content-Type: text/plain; charset="us-ascii"

Hello,

I have a question regarding the phrase table format.
I created a phrase table with the release 1.0 of Moses and I noticed that there is now a third digit on the frequency section which was not there in former versions. What does this third digit mean?

Some examples:
, Advanced , Advanced Edition Unlimited ||| , Advanced , Advanced Edition Unlimited ||| 0.5 0.100328 1 0.0238182 2.718 ||| 0-0 1-1 2-2 3-3 4-4 5-5 ||| 4 2 2
, Advanced , Advanced Edition ||| , Advanced , Advanced Edition ||| 1 0.13723 1 0.0459938 2.718 ||| 0-0 1-1 2-2 3-3 4-4 ||| 2 2 2
Thanks in advance,
Petra
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131008/0816b3ec/attachment-0001.htm

------------------------------

Message: 4
Date: Tue, 8 Oct 2013 10:23:47 +0000 (UTC)
From: Rico Sennrich <rico.sennrich@gmx.ch>
Subject: Re: [Moses-support] Phrase table format changed?
To: moses-support@mit.edu
Message-ID: <loom.20131008T122015-555@post.gmane.org>
Content-Type: text/plain; charset=utf-8

Petra Wolf <petra.wolf@...> writes:

>
>
> Hello,
>
> ?
> I have a question regarding the phrase table format.
>
>
> I created
> a phrase table with the release 1.0?of Moses and I noticed that there is
now a third digit on the frequency section which was not there in former
versions. What does this third digit mean?
>
>
>
> ?
> Some examples:
> , Advanced , Advanced Edition Unlimited ||| , Advanced , Advanced Edition
Unlimited ||| 0.5 0.100328 1 0.0238182 2.718 ||| 0-0 1-1 2-2 3-3 4-4 5-5 |||
4 2 2
> , Advanced , Advanced Edition ||| , Advanced , Advanced Edition ||| 1
0.13723 1 0.0459938 2.718 ||| 0-0 1-1 2-2 3-3 4-4 ||| 2 2 2
>
> Thanks in advance,
> Petra
>

Hi Petra,

the old version had c(e) and c(f) in the fifth column, and you could / had
to obtain c(e,f) by computing c(e)*p(f|e). The new version has c(e), c(f)
and c(e,f) in the fifth column.

best wishes,
Rico





------------------------------

Message: 5
Date: Tue, 8 Oct 2013 11:34:46 +0100
From: Hieu Hoang <Hieu.Hoang@ed.ac.uk>
Subject: Re: [Moses-support] Syntax Annotation Error Using
"parse-en-collins.perl"
To: Rajen Chatterjee <rajen.k.chatterjee@gmail.com>
Cc: "moses-support@mit.edu" <moses-support@mit.edu>
Message-ID:
<CAEKMkbiPF9Sa514oahw1+QMXPOLNm-vgjtZV5wTt9EorifwFOA@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"

Most of the script work by taking input from standard in and returning
output to standard out. ie.
./parse-en-collins.perl < input.file
if you want to parse just 1 sentence:
echo "sentence blah blah" | ./parse-en-collins.perl
Reading through the source code for the script, you have to give it the
path of the collins parser, and a program to mxpost.

Unfortunately there no documentation for this script. Once you figure out
how to use it, it would be good if you can write something about how to use
it.


On 7 October 2013 07:22, Rajen Chatterjee <rajen.k.chatterjee@gmail.com>wrote:

> HI,
> Thanks but I want to know the usage of "parse-en-collins.perl"
> I have tried 2 ways
> 1) ./parse-en-collins.perl ./test.txt (where test.txt
> contains this line "I am going to school .")
> 2) ./parse-en-collins.perl "I am going to school ."
>
> In both of this usage I get this o/p:
>
>
> ----------------------------------------------------------------------------------------------------------------------------------------------------
> Read 11692 items from /home/rajen/Public/jmx/tagger.project/word.voc
> Read 45 items from /home/rajen/Public/jmx/tagger.project/tag.voc
> Read 42680 items from
> /home/rajen/Public/jmx/tagger.project/tagfeatures.contexts
> Read 42680 contexts, 117558 numFeatures from
> /home/rajen/Public/jmx/tagger.project/tagfeatures.fmap
> Read model /home/rajen/Public/jmx/tagger.project/model :
> numPredictions=45, numParams=117558
> Read tagdict from /home/rajen/Public/jmx/tagger.project/tagdict
> *This is MXPOST (Version 1.0)*
> *Copyright (c) 1997 Adwait Ratnaparkhi*
>
> ----------------------------------------------------------------------------------------------------------------------------------------------------
>
> The program halts after this. Why is it halting?
>
> On Sun, Oct 6, 2013 at 4:03 AM, Philipp Koehn <pkoehn@inf.ed.ac.uk> wrote:
> > Hi,
> >
> > the wrapper script calls MXPOST and Collins parser internally, so it
> takes
> > raw tokenized text, not already output from the parser.
> >
> > If you already have parsed data, then you will need to take the script
> > apart and use only the conversion part.
> >
> > -phi
> >
> >
> > On Thu, Oct 3, 2013 at 11:22 AM, Rajen Chatterjee
> > <rajen.k.chatterjee@gmail.com> wrote:
> >>
> >> Hi,
> >> I want to do syntax annotation for which I have used Collins
> >> Parser, here is the o/p of Collins Parser:
> >> PROB 756 -35.6989 0
> >> TOP -35.6989 S -33.1933 NP -1.15926 NPB -0.869919 NNP 0 Pilgrimage
> >> VP -28.0845 VBZ 0 is
> >> PP -21.0829 IN 0 of
> >> NP -12.3089 NPB -4.28776 JJ 0 utmost
> >> NN 0 importance
> >> PP -4.72756 IN 0 in
> >> NP -1.15926 NPB -0.869919 NNP 0 Hinduism
> >> (TOP~is~1~1 (S~is~2~2 (NPB~Pilgrimage~1~1 Pilgrimage/NNP ) (VP~is~2~1
> >> is/VBZ (PP~of~2~1 of/IN (NP~importance~2~1 (NPB~importance~2~2
> >> utmost/JJ importance/NN ) (PP~in~2~1 in/IN (NPB~Hinduism~1~1
> >> Hinduism/NNP ./PUNC. ) ) ) ) ) ) )
> >> TIME 0
> >>
> >>
> >> Now I want to convert it to Moses format for which I am using
> >> parse-en-collins.perl, but I don't know the usage of this script. It
> >> is said that the script "parse-en-collins.perl" takes Collins Parser
> >> output and convert to Moses format but if I pass the above o/p it give
> >> me syntax error. So can I know the usage with a small example.
> >>
> >> Thanks
> >>
> >> --
> >> -Regards,
> >> Rajen Chatterjee.
> >> _______________________________________________
> >> Moses-support mailing list
> >> Moses-support@mit.edu
> >> http://mailman.mit.edu/mailman/listinfo/moses-support
> >
> >
>
>
>
> --
> -Regards,
> Rajen Chatterjee.
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>



--
Hieu Hoang
Research Associate
University of Edinburgh
http://www.hoang.co.uk/hieu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131008/df2afef6/attachment.htm

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 84, Issue 13
*********************************************

0 Response to "Moses-support Digest, Vol 84, Issue 13"

Post a Comment