Difference between revisions of "Parsing (State of the art)"

Revision as of 12:40, 23 May 2008

Performance measure: PARSEVAL - the evalb program
Training data: sections 2-22 of Wall Street Journal corpus
Testing data: section 23 of Wall Street Journal corpus

Table of results

System name	Short description	Main publications	Software	Results (PARSEVAL)	Comments
Charniak & Johnson's Parser	Lexicalized N-Best PCFG + Discriminative reranking	Johnson and Charniak (2005)	Download	91.4%	also works well on Brown
Self-trained Charniak & Johnson Parser	Above + self-training on ~2 million raw sentences from NANC	McClosky, Charniak, and Johnson (2006)	Download	92.1%	also works well on Brown
Collins' Parser	Lexicalized PCFG	Collins (1999), Bikel (2004)	Dan Bikel's implementation	?	?
Berkeley Parser	Automatically induced PCFG	Petrov et al. (2006), Petrov and Klein (2007)	Berkeley Parser	90.1%	works well also for Chinese and German
Link Grammar	Dependency grammar	Temperley, Sleator, Lafferty, others (1995-2006)	Actively supported project	?	Persian, Arabic, Chinese, German, Russian dictionaries have been developed.

References

Bikel, D. (2004). On The Parameter Space of Generative Lexicalized Statistical Parsing Models. PhD Thesis, Computer and Information Science, University of Pennsylvania.

Collins, M. (1999). Head-driven Statistical Models for Natural Language Parsing. PhD Thesis, Computer and Information Science, University of Pennsylvania.

Charniak, E. and Johnson, M. (2005). Coarse-to-fine n-best parsing and MaxEnt discriminative reranking. Proceedings of the 43rd Annual Meeting of the ACL, pages 173–180, Ann Arbor, June 2005.

McClosky, D., Charniak, E., and Johnson, M. (2006) Effective Self-Training for Parsing. Proceedings of HLT/NAACL 2006, pages 152-159, New York City, USA, June 2006.

Petrov, S., Barrett, L., Thibaux, R., and Klein, D. (2006). Learning accurate, compact, and interpretable tree annotation. Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the ACL, pages 433–440, Sydney.

Petrov, S., and Klein, D. (2007). Improved inference for unlexicalized parsing. Proceedings of NAACL 2007, pages 404-411.

Sleator, Daniel & Davy Temperly (1993) "Parsing English with a Link Grammar", Third International Workshop on Parsing Technologies.

External links

PARSEVAL - the evalb program

@@ Line 16: / Line 16: @@
 ! Comments
 |-
-| Johnson & Charniak's Parser
+| Charniak & Johnson's Parser
-| Lexicalized N-Best PCFG + Discriminative re-reanking
+| Lexicalized N-Best PCFG + Discriminative reranking
 | Johnson and Charniak (2005)
-| [http://www.cog.brown.edu/~mj/Software.htm download]
+| [http://www.cog.brown.edu/~mj/Software.htm Download]
 | 91.4%
-| works well also on Brown
+| also works well on Brown
+|-
+| Self-trained Charniak & Johnson Parser
+| Above + self-training on ~2 million raw sentences from NANC
+| McClosky, Charniak, and Johnson (2006)
+| [http://www.cs.brown.edu/~dmccm/selftraining Download]
+| 92.1%
+| also works well on Brown
 |-
 | Collins' Parser
@@ Line 52: / Line 59: @@
 Collins, M. (1999). ''[http://people.csail.mit.edu/mcollins/papers/thesis.ps Head-driven Statistical Models for Natural Language Parsing]''. PhD Thesis, Computer and Information Science, University of Pennsylvania.
-Johnson, M., and Charniak, E. (2005). [http://acl.ldc.upenn.edu/P/P05/P05-1022.pdf Coarse-to-fine n-best parsing and MaxEnt discriminative reranking]. ''Proceedings of the 43rd Annual Meeting of the ACL'', pages 173–180, Ann Arbor, June 2005.
+Charniak, E. and Johnson, M. (2005). [http://acl.ldc.upenn.edu/P/P05/P05-1022.pdf Coarse-to-fine n-best parsing and MaxEnt discriminative reranking]. ''Proceedings of the 43rd Annual Meeting of the ACL'', pages 173–180, Ann Arbor, June 2005.
+McClosky, D., Charniak, E., and Johnson, M. (2006) [http://www.aclweb.org/anthology/N/N06/N06-1020.pdf Effective Self-Training for Parsing]. ''Proceedings of HLT/NAACL 2006'', pages 152-159, New York City, USA, June 2006.
 Petrov, S., Barrett, L., Thibaux, R., and Klein, D. (2006). [http://www.eecs.berkeley.edu/~petrov/data/acl06.pdf Learning accurate, compact, and interpretable tree annotation]. ''Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the ACL'', pages 433–440, Sydney.

Difference between revisions of "Parsing (State of the art)"

Revision as of 12:40, 23 May 2008

Contents

Table of results

References

See also

External links

Navigation menu

Search