Andrea Gesmundo

Biography

I am a PhD student in Computer Science working on the COMTIS Project at the Université de Genève and a member of the the Computational Learning and Computational Linguistics Research Group (CLCL). I am advised by Paola Merlo and James Henderson. My research interests include the application of Machine Learning Techniques as Bayesian Models, HMM, Perceptron Algorithm, to solve a wide range of Statistical Natural Language Processing tasks as Syntactic Parsing, Semantic Role Labeling, Part of Speech Tagging, Named Entity Recognition, or Shallow Parsing, and more recently Machine Translation. You can see my CV here.

Publications

Heuristic Cube Pruning in Linear Time ( previously known as: Approximate Cube Pruning in Linear Time )
Andrea Gesmundo, Giorgio Satta, James Henderson. ACL (2012)
bibtex

Lemmatisation as a Tagging Task
Andrea Gesmundo, Tanja Samardzic. ACL (2012)
NEW! Try this model Online: Online Lemmatiser
Code: BTagger@github
bibtex

HadoopPerceptron: a Toolkit for Distributed Perceptron Training and Prediction with MapReduce
Andrea Gesmundo, Nadi Tomeh. EACL (2012), Demo Session
NEW! version 2.0 released
Code: HadoopPerceptron@github
bibtex    (57% acceptance rate)

Heuristic Search for Non-Bottom-Up Tree Structure Prediction
Andrea Gesmundo, James Henderson. EMNLP (2011)
poster    bibtex    (24% acceptance rate)

Bidirectional Sequence Classification for Tagging Tasks with Guided Learning
Andrea Gesmundo. TALN (2011)
Code: BTagger@github
Try it on MLcomp: mlcomp.org/programs/966
bibtex

Faster Cube Pruning
Andrea Gesmundo, James Henderson. IWSLT (2010)
Faster Cube Pruning has been integrated in CDEC
Code: github.com/redpony/cdec
Cdec documentation and build instructions: cdec-decoder.org
Data: cdec-decoder.org/index.php?title=Cdec_sample_grammar_and_test_set
To switch on Algorithm 2 add the command line option "-I Fast_cube_pruning"
To switch on Algorithm 3 add the command line option "-I Fast_cube_pruning_2"
bibtex

Bidirectional Sequence Classification for Part of Speech Tagging
Andrea Gesmundo. EVALITA (2009)
(2nd result out of 11, 1st non-combined system)
Try this model Online: Italian POS Tagger
Code: github.com/agesmundo/BTagger
bibtex

Bidirectional Sequence Classification for Named Entities Recognition
Andrea Gesmundo. EVALITA (2009)
(2nd result out of 12, 1st non-combined system)
Code: github.com/agesmundo/BTagger
bibtex

A Latent Variable Model of Synchronous Syntactic-Semantic Parsing for Multiple Languages
Andrea Gesmundo, James Henderson, Paola Merlo and Ivan Titov. CoNLL (2009)
(3rd result out of 14, 1st result for syntax)
Code: github.com/agesmundo/IDParser
bibtex

More Code & Data

Minimum Bayes-Risk for Hierarchical Machine Translation

Language Model for Italian
Trigrams, from Europarl corpus v6 (~700k sentences).

Online Lemmatizer for Serbian and Croatian

Tutorials slides

Introduction to Sampling and Reversible-Jump Markov Chain Monte Carlo.

Introduction to Bayesian Models and Dirichlet Process.

Master's Thesis

Natural Language Processing with Bidirectional Features.
Andrea Gesmundo. Università di Padova (2007).
Thesis supervisor: Giorgio Satta

Contacts

email:


address:
Battelle A
7, route de Drize
1227 Carouge
SWITZERLAND

phone:
+41 22 37 90136

twitter:
@agesmundo


Google Scholar




Locations of visitors to this page