news

Nov 30, 2023 Excited to be appointed as an Associate Director of USC’s Center for AI in Society.
Sep 11, 2023 Honored to receive the Intel Rising Stars Award in 2023!
Jun 23, 2023 Taught NLP and Language Models to high-school students via USC Viterbi K-12 Discover Engineering. Slides adapted from Greg Durrett (thanks!).
May 26, 2023 Made an appearance on ABC7 Live where I talked about the impact of AI.
May 09, 2023 New preprint on NeuroComparatives now available!
May 02, 2023 Three papers accepted to ACL 2023: REV and I2D2 at the main conference and COBRA to Findings.
Apr 27, 2023 New Preprint on Ambiguity in natural language and how LLMs handle it: AMBIENT.
Apr 05, 2023 Got invited as a speaker at the USC Sidney Harmon Academy for Polymathic Studies for a discussion with Kate Crawford on AImaginings, a discussion on AI’s potential to memorize and imagine.
Feb 28, 2023 Got featured in an USC Viterbi news article on ChatGPT.
Feb 28, 2023 Gave an invited talk on Designing Controls and Filters for Dataset Generation at Spotify Research Seminar.
Feb 03, 2023 Gave an invited talk on Interpreting datasets to enable better data creationat the Data-Centric AI seminar series at Amazon.
Feb 01, 2023 Gave an invited talk on Contextualizing Bias in Hate Speech Detection at the CAIS++ Seminar.
Jan 20, 2023 Submitted four long papers to ACL along with collaborators from AI2, UW, CMU and Intel.
Dec 10, 2022 Taught a class on Data-Centric Machine Learning at the Online Asian Machine Learning School (OAMLS) at ACML 2022.
Nov 29, 2022 Heading over to EMNLP 2022 in Abu Dhabi next week, hope to meet lots of old and new friends!
Nov 21, 2022 Was interviewed by the MIT Technology Review on dataset quantity and quality: read the coverage here.
Nov 18, 2022 Attended the SoCalNLP Symposium hosted by UCSB, along with the DILL Lab. Gave an invited talk on my research on generating datasets!
Nov 09, 2022 Gave an invited talk at the USC CAIS Seminar on my research on Hate Speech Detection - watch it here!
Nov 08, 2022 Presented an overview of my research to the USC Viterbi CS Department Advisory Board.
Oct 06, 2022 Three papers accepted to EMNLP / Findings: WaNLI, NeuroCounterfactuals and Investigating Free-Form Rationales.
Aug 22, 2022 Started teaching my first class: CSCI 699 - Data-Centric NLP.
Aug 16, 2022 Started as an Assistant Professor of Computer Science at USC Viterbi School of Engineering, where I’m launching the DILL Lab.
Jul 29, 2022 Last day as a Young Investigator at AI2… I’ll miss being here but so excited about USC!
Jul 19, 2022 Super thrilled to receive an outstanding paper award at ICML 2022 for our work on V-info!
Jul 17, 2022 Attending ICML 2022 virtually, where Kawin be presenting our work on understanding dataset difficulty.
Jul 14, 2022 Had a blast serving as a panelist in the DADC workshop at NAACL. Also our DeepLo 2022 Workshop was quite successful!
Jul 10, 2022 Attending NAACL 2022, where we’ll be presenting Reframing Human-AI Collaboration and Annotators with Attitudes.
Jun 17, 2022 Served as a panelist on the Responsible AI Symposium 2022 for AILA.
May 23, 2022 Honored to have been voted as an invited speaker for ACL STIRS, where I presented my work on datasets!
May 15, 2022 Paper on Dataset Difficulty with my former intern, Kawin Ethayarajh, now accepted at ICML as a long presentation.
May 13, 2022 Had a wonderful time visiting UC Irvine at the CS Seminar; first in-person invited talk for me in a long time!
Apr 15, 2022 We hired a brilliant new incoming cohort of PhD students at USC NLP for Fall 2022! More details soon!
Apr 07, 2022 Two upcoming papers at NAACL: Annotators with Attitudes and Reframing Human-AI Collaboration for Generating Free-Text Explanations.
Mar 16, 2022 Excited to present work on Dataset Construction at IFDS Ethics Seminar and at MilaNLP’s Coding Aperitivo this week.
Jan 15, 2022 New preprint on WaNLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation.
Dec 16, 2021 New preprint on Human-AI Collaboration for Generating Free-Text Explanations with Jack Hessel and our intern, Sarah Wiegreffe.
Nov 30, 2021 🏆 Our paper on MAUVE received an outstanding paper award at NeurIPS 2021 making it one of 6 papers out of 9K submissions!
Nov 21, 2021 The third edition DeepLo for NLP workshop to be collocated with NAACL 2022.
Nov 15, 2021 New preprint on Annotators with Attitudes, presented at the Text as Data conference.
Oct 20, 2021 Invited virtual talk at the Machine Learning Research Group at Oracle.
Oct 15, 2021 New preprint on Information-Theoretic Measures of Dataset Difficulty with my intern, Kawin Ethayarajh.
Oct 12, 2021 Attended the Trustworthy NLP Workshop at Google.
Sep 28, 2021 Paper on MAUVE, an Information Divergence Measure between Neural Text and Human Text, accepted as a NeurIPS 2021 oral!
Sep 13, 2021 Paper on Data Augmentation for Frame-SRL, with Miriam R. L. Petruck to appear at the LAW-DMR workshop at EMNLP 2021.
Sep 07, 2021 Abstract on Biases in Toxic Language Detection:The Role of Annotator Beliefs And Demographics accepted to Text As Data 2021. Full paper coming soon!
Aug 26, 2021 Paper with my intern, Alon Jacovi, on Contrastive Explanations for Model Interpretability to appear at EMNLP 2021.
May 05, 2021 Paper on On-the-Fly Controlled Text Generation with Experts and Anti-Experts to appear at ACL 2021.
Mar 04, 2021 Guest lecture on Transfer Learning at UW Stats: UW DATA 598 Statistical Deep Learning, taught by Zaid Harachoui.
Mar 01, 2021 Check out our new pre-print on contrastive explanations for model decisions! Work with my intern Alon Jacovi and others!
Feb 24, 2021 Talk at the NERT Seminar at Georgetown University! So honored to be an elected speaker :)
Feb 12, 2021 Invited talk at the NLP Seminar at Georgia Tech!
Feb 03, 2021 Check out our new pre-print on an evaluation metric for open-ended text generation, MAUVE !
Feb 01, 2021 New ACL submission on controlled generation, with exciting applications. Keep an eye out!
Jan 11, 2021 Paper on Challenges in Social Bias Mitigation in Hate Speech Detection to appear at EACL 2021!
Dec 03, 2020 Guest lecture in Eunsol Choi’s Topics in NLP class at UT Austin on Biases and Interpretability.
Nov 02, 2020 Was delighted to be an invited speaker for Responsible AI at the Microsoft E+D Product Leaders Conference.
Sep 15, 2020 Paper on Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics is now accepted to the Proceedings of EMNLP, and GDaug is accepted to Findings of EMNLP.
Aug 13, 2020 Completed one year as a postdoctoral investigator at AI2!
Jul 08, 2020 Our paper Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks received an Honorable Mention Award at ACL 2020!
Jun 03, 2020 New submission to EMNLP finally done, shoot me an email if you’d like to learn more.
May 31, 2020 Our paper on Adversarial Filters of Dataset Biases has been accepted to ICML!
Apr 11, 2020 New EMNLP preprint on Generative Data Augmentation for Commonsense Reasoning, or G-DAUG now available on arXiv.
Apr 03, 2020 Two papers accepted to ACL! Kudos to my wonderful collaborators on Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks and The Right Tool for the Job: Matching Model and Instance Complexities.
Feb 02, 2020 New preprint on Adversarial Filters of Dataset Biases now available on arXiv.
Oct 17, 2019 Invited talk at the UW Linguistics Colloquium on Oct 18, 2019.
Oct 03, 2019 Got selected to attend the Rising Stars in EECS workshop at the University of Illinois at Urbana-Champaign from Oct 29 - Nov 1, 2019.
Sep 25, 2019 Submitted our latest paper to ICLR 2020.
Aug 13, 2019 Joined AI2 as a Postdoctoral Young Investigator.
Jun 02, 2019 Presented our tutorial on Transfer Learning in Natural Language Processing at NAACL 2019 in Minnesota.
May 08, 2019 Defended my PhD thesis!