Yi Yang (杨忆)

About Me

    Oct 2018 - present

    Jan 2017 - Sep 2018

    Aug 2011 - Dec 2016


    Contact



    Research

  • Director of NLP, ASAPP

  • Senior Research Scientist, Bloomberg LP

  • Ph.D. in Computer Science, Georgia Tech
    Advisor: Jacob Eisenstein

  • yiyangnlp at gmail dot com


  • Natural language processing (NLP) and machine learning. I am particularly interested in working with nonstandard languages (e.g., social media text) and building NLP systems that require global reasoning (structured prediction).

Journal Papers

  • Overcoming Language Variation in Sentiment Analysis with Social Attention.
    Yi Yang and Jacob Eisenstein.
    Transactions of the Association for Computational Linguistics (TACL), 2017 (Presented at ACL 2017).
    [PDF] [Code] [Slides: Keynote, PDF]

Conference Papers

  • Fast and Accurate Factual Inconsistency Detection Over Long Documents.
    Barrett Martin Lattimer, Patrick Chen, Xinyuan Zhang, and Yi Yang.
    In Processings of Empirical Methods in Natural Language Processing (EMNLP), 2023.
    [PDF]

  • CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes.
    James Mullenbach, Yada Pruksachatkun, Sean Adler, Jennifer Seale, Jordan Swartz, Greg McKelvey, Hui Dai, Yi Yang, and David Sontag.
    In Proceedings of the Association for Computational Linguistics (ACL), 2021.

  • Action-Based Conversations Dataset: A Corpus for Building More In-Depth Task-Oriented Dialogue Systems.
    Derek Chen, Howard Chen, Yi Yang, Alexander Lin, and Zhou Yu.
    In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL), 2021.

  • Simple and Effective Few-Shot Named Entity Recognition with Structured Nearest Neighbor Learning.
    Yi Yang and Arzoo Katiyar.
    In Processings of Empirical Methods in Natural Language Processing (EMNLP), 2020.
    [PDF] [Code]

  • One Model to Recognize Them All: Marginal Distillation from NER Models with Different Tag Sets.
    Keunwoo Peter Yu and Yi Yang.
    ArXiv, 2020.
    [PDF]

  • Dialog Intent Induction with Deep Multi-View Clustering.
    Hugh Perkins and Yi Yang.
    In Processings of Empirical Methods in Natural Language Processing (EMNLP), 2019.
    [PDF] [Code]

  • Syntax-Infused Variational Autoencoder for Text Generation.
    Xinyuan Zhang, Yi Yang, Siyang Yuan, Dinghan Shen, and Lawrence Carin.
    In Proceedings of the Association for Computational Linguistics (ACL), 2019.
    [PDF]

  • A Semi-Markov Structured Support Vector Machine Model for High-Precision Named Entity Recognition.
    Ravneet Arora, Chen-Tse Tsai, Ketevan Tsereteli, Prabhanjan Kambadur, and Yi Yang.
    In Proceedings of the Association for Computational Linguistics (ACL), 2019.
    [PDF]

  • Convolutional Neural Networks with Recurrent Neural Filters.
    Yi Yang.
    In Proceedings of Empirical Methods in Natural Language Processing (EMNLP), 2018.
    [PDF] [Slides: Keynote, PDF] [Code]

  • Collective Entity Disambiguation with Structured Gradient Tree Boosting.
    Yi Yang, Ozan Irsoy, and Kazi Shefaet Rahman.
    In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL), 2018.
    [PDF] [Poster] [Code]

  • Toward Socially-Infused Information Extraction: Embedding Authors, Mentions, and Entities.
    Yi Yang, Ming-Wei Chang, and Jacob Eisenstein.
    In Proceedings of Empirical Methods in Natural Language Processing (EMNLP), 2016.
    [PDF] [Poster]

  • Part-of-Speech Tagging for Historical English.
    Yi Yang and Jacob Eisenstein.
    In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL), 2016.
    [PDF] [Slides: Keynote, PDF] [Talk]

  • WikiQA: A Challenge Dataset for Open-Domain Question Answering.
    Yi Yang, Wen-tau Yih, and Christopher Meek.
    In Proceedings of Empirical Methods in Natural Language Processing (EMNLP), 2015.
    [PDF] [Slides: PPT, PDF] [Data] [Code]

  • S-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking.
    Yi Yang and Ming-Wei Chang.
    In Proceedings of the Association for Computational Linguistics (ACL), 2015.
    [PDF] [Slides: Keynote, PDF] [Data] [Talk]

  • Unsupervised Multi-Domain Adaptation with Feature Embeddings.
    Yi Yang and Jacob Eisenstein.
    In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL), 2015.
    [PDF] [Poster] [Code]

  • Fast Easy Unsupervised Domain Adaptation with Marginalized Structured Dropout.
    Yi Yang and Jacob Eisenstein.
    In Proceedings of the Association for Computational Linguistics (ACL), 2014.
    [PDF] [Poster] [Code]

  • A Log-Linear Model for Unsupervised Text Normalization.
    Yi Yang and Jacob Eisenstein.
    In Proceedings of Empirical Methods in Natural Language Processing (EMNLP), 2013.
    [PDF] [Slides] [Code]

  • Quality-biased Ranking of Short Texts in Microblogging Services.
    Minlie Huang, Yi Yang, and Xiaoyan Zhu.
    In Proceedings of the International Joint Conference on Natural Language Processing (IJCNLP), 2011.
    [PDF]

  • Learning to Identify Review Spam.
    Fangtao Li, Minlie Huang, Yi Yang, and Xiaoyan Zhu.
    In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), 2011.
    [PDF]

Workshop Papers

  • Unsupervised Domain Adaptation with Feature Embeddings.
    Yi Yang and Jacob Eisenstein.
    In proceedings of the International Conference on Learning Representations (Workshop Track; ICLR), 2015.
    [PDF] [Poster] [Code]

  • A Statistical Machine Translation Approach to Entity Recognition.
    Yi Yang and Lambert Mathias.
    In proceedings of Amazon Conference on Machine Learning, 2013.

  • Identifying Protein-protein Interactions in Biomedical Text Articles.
    Rezarta Islamaj Doğan, Yi Yang, Aurélie Névéol, Minlie Huang, and Zhiyong Lu.
    In proceedings of the BioCreative Challenge Evaluation Workshop, 2010.

Teaching

  • CS4464/6465: Computational Journalism
    Instructor: Jacob Eisenstein
    Georgia Tech, Spring 2016.

  • CS 4365/6365: Intro to Enterprise Computing
    Instructor: Calton Pu
    Georgia Tech, Spring 2013.

  • J2EE Architecture
    Instructor: Xueping Shen
    Beihang University, Spring 2011.

  • Java Programming Language
    Instructor: Xueping Shen
    Beihang University, Fall 2010.

  • Algorithm Analysis & Design
    Instructor: You Song
    Beihang University, Fall 2009.

Service

Industry Track Chair

  • NAACL 2024

Area Chair

  • ACL 2018, NAACL 2018

Program Committee Member

  • ACL, EMNLP, NAACL, NeurIPS, ICML, ICLR, EACL, IJCNLP, WNUT, SEM, RepL4NLP, SocialNLP, DeepStruct

Journal Reviewer

  • Transactions of the Association for Computational Linguistics (TACL), Journal of Artificial Intelligence Research (JAIR), Transactions on Information Systems (TOIS), Journal of Biomedical Informatics


  • The webpage template is provided by Nan Du and Yingyu Liang.