I am a post-doctoral researcher at NSF AI Institute for Student-AI Teaming (iSAT) with James Martin and Boulder NLP Group . I obtained my Ph.D. from the School of Computing at the University of Utah, where I worked with Vivek Srikumar and the Uath NLP Group. My research interests centered around natural language processing and machine learning. I study methods for predicting structured representations of natural language text, e.g. Semantic Parsing, Dialogue System. I interned at WeChatAI, and Amazon Lex from 2018-2020 summer.



  • Zhimin Li, Shusen Liu, Xin Yu, Kailkhura Bhavya, Jie Cao, Diffenderfer James Daniel, Peer-Timo Bremer, and Valerio Pascucci. 2022. "Understanding Robustness Lottery": A Comparative Visual Analysis of Neural Network Pruning Approaches. arXiv preprint arXiv:2206.07918.
    • BibTeX
    • Download
  • Debjyoti Paul*, Jie Cao*, Feifei Li, and Vivek Srikumar. 2021. Database workload characterization with query plan encoders. Proceedings of the VLDB Endowment, 15(4):923–935.
    • BibTeX
    • Download
  • Jie Cao and Yi Zhang. 2021. A Comparative Study on Schema-Guided Dialogue State Tracking. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 782–796.
    • BibTeX
    • Download
    • Poster
  • Jie Cao, Yi Zhang, Adel Youssef, and Vivek Srikumar. 2019. Amazon at MRP 2019: Parsing Meaning Representations with Lexical and Phrasal Anchoring. In Proceedings of the Shared Task on Cross-Framework Meaning Representation Parsing at the Conference on Natural Language Learning(CoNLL), pages 138–148.
    • BibTeX
    • Download
  • Jie Cao, Michael Tanana, Zac Imel, Eric Poitras, David Atkins, and Vivek Srikumar. 2019. Observing Dialogue in Therapy: Categorizing and Forecasting Behavioral Codes. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics.
    • BibTeX
    • Download
    • Slides
  • Zhiqiang Liu, Zuohui Fu, Jie Cao, Gerard de Melo, Yik-Cheung Tam, Cheng Niu, and Jie Zhou. 2019. Rhetorically Controlled Encoder-Decoder for Modern Chinese Poetry Generation. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics.
    • BibTeX
    • Download
  • Shuo, Sun*, Yik-Cheung Tam*, Jie Cao*, Canxiang Yan, Zuohui Fu, Cheng Niu, and Jie Zhou. 2019. End-to-end Gated Self-attentive Memory Network for Dialog Response Selection. In AAAI DSTC7 Workshop (Equal Contribution).
    • BibTeX
    • Download
    • Poster
  • Xijiang Ke, Hai Jin, Xia Xie, and Jie Cao. 2015. A distributed SVM method based on the iterative MapReduce. In Semantic Computing (ICSC), IEEE International Conference on, pages 116–119. IEEE.
    • BibTeX
    • Download
  • Xia Xie, Jie Cao, Hai Jin, Xijiang Ke, and Wenzhi Cao. 2012. JRBridge: A framework of large-scale statistical computing for R. In Services Computing Conference (APSCC), IEEE Asia-Pacific, pages 27–34. IEEE.
    • BibTeX
    • Download

Research Experience

  • [08/2015 - now ] Research Assistant at Utah NLP Lab, Univeristy of Utah, Salt Lake City
  • [06/2020 - 12/2020] Applied Scientist Intern at AWS AI, Amazon Lex, Remote
    • Our paper on schema-guided dialog got accepted by NAACL 2021.
  • [06/2019 - 09/2019] Applied Scientist Intern at AWS AI, Amazon Lex, Seattle
    • In CoNLL shared task MRP 2019, over 16 teams, our system on cross-framework meaning representation parsing ranked 1st in AMR parsing task, 5th in UCCA, 6th and 7th in PSD and DM tasks. Spotlight Talk
  • [05/2018 - 08/2018] Research Intern at Tecent, WechatAI, Palo Alto
    • Our dialogue system based Gated Attentive Memory Network ranked Top 2 in DSTC7, and got accepted by AAAI 2019 DSTC7 workshop.
  • [09/2008 - 03/2012] Research Assistant at CGCL Lab, Huazhong University of Science and Technology, Wuhan
    • I worked closely with Prof. Xia Xie and Prof. Hai Jin. My research interests are widely around Xen, Xen-ARM virtualization, and distributed computing. We study equipping R language with JVM-based large scale distributed statistical infrastructure, such as Hadoop, Spark.

Work Experience

  • [10/2014 - 07/2015] Assistant Researcher, SOHU RDC Lab, Beijing
    • Hadoop, Spark, Data migration, Data security, Distributed machine learning
  • [07/2013 - 06/2014] Senior Software Engineer, ZUN CLUB (Startup), Beijing
    • Heterogeneous data intergration, Hotel recommendation system.
  • [03/2012 - 06/2013] Software Engineer, Baidu, Beijing
    • Voice Assistant, Mobile Search, Speed optimization, Mobile Anti-Attack
  • [08/2010 - 05/2011] Software Engineer Intern, Alibaba, Hangzhou
    • KV Storage, MySQL, Database Replication, Real-time Computing, Distributed Pub/Sub Data Pipeline.

Teaching & Mentoring

Academic Service

  • PC Member / Reviewers for MRP’2019, ACL’20-22, EMNLP’20-22, NAACL’21, EACL’21, COLING’20, AAAI’19-22, ACL Rolling Review’21-22

Honors and Awards

  • [2019] CoNLL Shared Task, Cross-framework meaning representation parsing, ranked 1st(over 16 teams) for AMR parsing task.
  • [2018] DSTC7 track1, ranked 2nd for both advising and ubuntu in subtask 5(with external knowledge)
  • [2015] Our system ‘Talking Geckos’ winned 1st in a question-answering competition during Fall 2015 NLP class.
  • [2010] VMware Cloud Computing Innovation Cup, Top 50
  • [2009] Google Andriod Innovative Idea Sharing Award
  • [2007] “Computer World” Magazine Scholarship (50 students awarded in China)
  • [2007] Microsoft ImagineCup
    • Algorithm Challenge, Top 50
    • Visual Gaming Contest(Project Hoshimi), Top 2 in China, 18th in world final.
  • [2006] HUST ACM Programming Contest, Top 3

Curriculum Vitae