Publications

2017

EMNLP, 2017

Details PDF Code BibTeX Abstract

ACL (short), 2017

Details PDF Video Dataset BibTeX Abstract

WWW, 2017

Details PDF Code BibTeX Abstract

TACL, 2017

Details PDF Code BibTeX Abstract Citations

2016

EECS Department, University of California, Berkeley, 2016

Details PDF BibTeX Abstract

2015

EMNLP, 2015

Details PDF Poster Code BibTeX Abstract Citations

2013

EMNLP, 2013

Details PDF Slides PDF Slides Code BibTeX Abstract Citations

ACL (short), 2013

Details PDF Slides PDF Slides Code BibTeX Abstract Citations

The Astrophysical Journal Supplement Series, 2013

Details PDF BibTeX Abstract Citations

2012

EMNLP, 2012

Details PDF Slides PDF Slides Code BibTeX Abstract Citations

ACL (short), 2012

Details PDF Slides PDF Slides Code BibTeX Abstract Citations

2011

CoNLL Shared Task, 2011

Details PDF Poster BibTeX Abstract Citations

2010

Physical Review Letters, 2010

Details PDF BibTeX Abstract Citations

CoLing, 2010

Details PDF BibTeX Abstract Citations

ACL, 2010

Details PDF PDF Slides Code BibTeX Abstract Citations

2009

ALTA, 2009

Details PDF PDF Slides BibTeX Abstract

The University of Sydney, 2009

Details PDF PDF Slides Poster BibTeX Abstract

Johns Hopkins University, 2009

Details PDF BibTeX Abstract Citations

2008

ALTA, 2008

Details PDF Poster BibTeX Abstract Citations

The Journal of Physical Chemistry B, 2008

Details PDF BibTeX Abstract Citations

Software

CCG to PST

A tool for converting CCG derivations into PTB-style phrase structure trees.

Coreference Error Analysis

A tool for classifying errors in coreference resolution.

One-Endpoint Crossing Graph Parser

A range of tools related to one-endpoint crossing graphs - parsing, format conversion, and evaluation.

Parse Error Analysis

A tool for classifying mistakes in the output of parsers.

Data

Adaptive CCG Supertagging Model

A model for the C&C supertagger that gives the same results with smaller beam sizes, enabling faster parsing.

Crowdsourced Paraphrases

Paraphrases collected while conducting experiments on factors influencing crowd performance.

IE/NER from Cybercriminal Forums

Forum posts with annotations of products.

Spine and Arc version of the Penn Treebank

Code to convert the standard Penn Treebank into a version where each word is assigned a spine of non-terminals, and arcs to indicate attachments from one spine to another.

Recent Posts

Papers I’m reading and more

More Posts

By using a single core model to build a game state representation, which then gives input to both state evaluation and move choice, DeepMind are able to apply reinforcement learning with self-play with no supervision and achieve state-of-the-art performance.

Continue Reading

To get context-dependence without recurrence we can use a network that applies attention multiple times over both input and output (as it is generated).

Continue Reading

By using a generative model to explain worker annotations, we can more effectively predict the correct label, and which workers are spamming.

Continue Reading

Another paper looking at the issue of output symbol sparsity in AMR parsing, though here the solution is to group the consistent but rare symbols (rather than graph fragments like the paper last week). This drastically increases neural model performance, but does not reach the level of hybrid systems.

Continue Reading

The WikiLinks dataset of text mentions that are hyperlinked to wikipedia articles provides a nice testing space for named entity disambiguation, and a neural network using attention over local context does reasonably well.

Continue Reading

Contact

  • [first letter of my first name + first seven letters of my last name] @umich.edu
  • 2260 Hayward Street, Ann Arbor, MI 48109, USA