Sep 11, 2023 |
Honored to receive the Intel Rising Stars Award in 2023!
|
Jun 23, 2023 |
Taught NLP and Language Models to high-school students via USC Viterbi K-12 Discover Engineering. Slides adapted from Greg Durrett (thanks!).
|
May 26, 2023 |
Made an appearance on ABC7 Live where I talked about the impact of AI.
|
May 9, 2023 |
New preprint on NeuroComparatives now available!
|
May 2, 2023 |
Three papers accepted to ACL 2023: REV and I2D2 at the main conference and COBRA to Findings.
|
Apr 27, 2023 |
New Preprint on Ambiguity in natural language and how LLMs handle it: AMBIENT.
|
Apr 5, 2023 |
Got invited as a speaker at the USC Sidney Harmon Academy for Polymathic Studies for a discussion with Kate Crawford on AImaginings, a discussion on AI’s potential to memorize and imagine.
|
Feb 28, 2023 |
Got featured in an USC Viterbi news article on ChatGPT.
|
Feb 28, 2023 |
Gave an invited talk on Designing Controls and Filters for Dataset Generation at Spotify Research Seminar.
|
Feb 3, 2023 |
Gave an invited talk on Interpreting datasets to enable better data creationat the Data-Centric AI seminar series at Amazon.
|
Feb 1, 2023 |
Gave an invited talk on Contextualizing Bias in Hate Speech Detection at the CAIS++ Seminar.
|
Jan 20, 2023 |
Submitted four long papers to ACL along with collaborators from AI2, UW, CMU and Intel.
|
Dec 10, 2022 |
Taught a class on Data-Centric Machine Learning at the Online Asian Machine Learning School (OAMLS) at ACML 2022.
|
Nov 29, 2022 |
Heading over to EMNLP 2022 in Abu Dhabi next week, hope to meet lots of old and new friends!
|
Nov 21, 2022 |
Was interviewed by the MIT Technology Review on dataset quantity and quality: read the coverage here.
|
Nov 18, 2022 |
Attended the SoCalNLP Symposium hosted by UCSB, along with the DILL Lab. Gave an invited talk on my research on generating datasets!
|
Nov 9, 2022 |
Gave an invited talk at the USC CAIS Seminar on my research on Hate Speech Detection - watch it here!
|
Nov 8, 2022 |
Presented an overview of my research to the USC Viterbi CS Department Advisory Board.
|
Oct 6, 2022 |
Three papers accepted to EMNLP / Findings: WaNLI, NeuroCounterfactuals and Investigating Free-Form Rationales.
|
Aug 22, 2022 |
Started teaching my first class: CSCI 699 - Data-Centric NLP.
|
Aug 16, 2022 |
Started as an Assistant Professor of Computer Science at USC Viterbi School of Engineering, where I’m launching the DILL Lab.
|
Jul 29, 2022 |
Last day as a Young Investigator at AI2… I’ll miss being here but so excited about USC!
|
Jul 19, 2022 |
Super thrilled to receive an outstanding paper award at ICML 2022 for our work on V-info!
|
Jul 17, 2022 |
Attending ICML 2022 virtually, where Kawin be presenting our work on understanding dataset difficulty.
|
Jul 14, 2022 |
Had a blast serving as a panelist in the DADC workshop at NAACL. Also our DeepLo 2022 Workshop was quite successful!
|
Jul 10, 2022 |
Attending NAACL 2022, where we’ll be presenting Reframing Human-AI Collaboration and Annotators with Attitudes.
|
Jun 17, 2022 |
Served as a panelist on the Responsible AI Symposium 2022 for AILA.
|
May 23, 2022 |
Honored to have been voted as an invited speaker for ACL STIRS, where I presented my work on datasets!
|
May 15, 2022 |
Paper on Dataset Difficulty with my former intern, Kawin Ethayarajh, now accepted at ICML as a long presentation.
|
May 13, 2022 |
Had a wonderful time visiting UC Irvine at the CS Seminar; first in-person invited talk for me in a long time!
|
Apr 15, 2022 |
We hired a brilliant new incoming cohort of PhD students at USC NLP for Fall 2022! More details soon!
|
Apr 7, 2022 |
Two upcoming papers at NAACL: Annotators with Attitudes and Reframing Human-AI Collaboration for Generating Free-Text Explanations.
|
Mar 16, 2022 |
Excited to present work on Dataset Construction at IFDS Ethics Seminar and at MilaNLP’s Coding Aperitivo this week.
|
Jan 15, 2022 |
New preprint on WaNLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation.
|
Dec 16, 2021 |
New preprint on Human-AI Collaboration for Generating Free-Text Explanations with Jack Hessel and our intern, Sarah Wiegreffe.
|
Nov 30, 2021 |
🏆 Our paper on MAUVE received an outstanding paper award at NeurIPS 2021 making it one of 6 papers out of 9K submissions!
|
Nov 21, 2021 |
The third edition DeepLo for NLP workshop to be collocated with NAACL 2022.
|
Nov 15, 2021 |
New preprint on Annotators with Attitudes, presented at the Text as Data conference.
|
Oct 20, 2021 |
Invited virtual talk at the Machine Learning Research Group at Oracle.
|
Oct 15, 2021 |
New preprint on Information-Theoretic Measures of Dataset Difficulty with my intern, Kawin Ethayarajh.
|
Oct 12, 2021 |
Attended the Trustworthy NLP Workshop at Google.
|
Sep 28, 2021 |
Paper on MAUVE, an Information Divergence Measure
between Neural Text and Human Text, accepted as a NeurIPS 2021 oral!
|
Sep 13, 2021 |
Paper on Data Augmentation for Frame-SRL, based on a pre-posal with Miriam R. L. Petruck to appear at the LAW-DMR workshop at EMNLP 2021.
|
Sep 7, 2021 |
Abstract on Biases in Toxic Language Detection:The Role of Annotator Beliefs And Demographics accepted to Text As Data 2021. Full paper coming soon!
|
Aug 26, 2021 |
Paper with my intern, Alon Jacovi, on Contrastive Explanations for Model Interpretability to appear at EMNLP 2021.
|
May 5, 2021 |
Paper on On-the-Fly Controlled Text Generation with Experts and Anti-Experts to appear at ACL 2021.
|
Mar 4, 2021 |
Guest lecture on Transfer Learning at UW Stats: UW DATA 598 Statistical Deep Learning, taught by Zaid Harachoui.
|
Mar 1, 2021 |
Check out our new pre-print on contrastive explanations for model decisions! Work with my intern Alon Jacovi and others!
|
Feb 24, 2021 |
Talk at the NERT Seminar at Georgetown University! So honored to be an elected speaker :)
|
Feb 12, 2021 |
Invited talk at the NLP Seminar at Georgia Tech!
|
Feb 3, 2021 |
Check out our new pre-print on an evaluation metric for open-ended text generation, MAUVE !
|
Feb 1, 2021 |
New ACL submission on controlled generation, with exciting applications. Keep an eye out!
|
Jan 11, 2021 |
Paper on Challenges in Social Bias Mitigation in Hate Speech Detection to appear at EACL 2021!
|
Dec 3, 2020 |
Guest lecture in Eunsol Choi’s Topics in NLP class at UT Austin on Biases and Interpretability.
|
Nov 2, 2020 |
Was delighted to be an invited speaker for Responsible AI at the Microsoft E+D Product Leaders Conference.
|
Sep 15, 2020 |
Paper on Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics is now accepted to the Proceedings of EMNLP, and GDaug is accepted to Findings of EMNLP.
|
Aug 13, 2020 |
Completed one year as a postdoctoral investigator at AI2!
|
Jul 8, 2020 |
Our paper Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks received an Honorable Mention Award at ACL 2020!
|
Jun 3, 2020 |
New submission to EMNLP finally done, shoot me an email if you’d like to learn more.
|
May 31, 2020 |
Our paper on Adversarial Filters of Dataset Biases has been accepted to ICML!
|
Apr 11, 2020 |
New EMNLP preprint on Generative Data Augmentation for Commonsense Reasoning, or G-DAUG now available on arXiv.
|
Apr 3, 2020 |
Two papers accepted to ACL! Kudos to my wonderful collaborators on Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks and The Right Tool for the Job: Matching Model and Instance Complexities.
|
Feb 2, 2020 |
New preprint on Adversarial Filters of Dataset Biases now available on arXiv.
|
Oct 17, 2019 |
Invited talk at the UW Linguistics Colloquium on Oct 18, 2019.
|
Oct 2, 2019 |
Got selected to attend the Rising Stars in EECS workshop at the University of Illinois at Urbana-Champaign from Oct 29 - Nov 1, 2019.
|
Sep 25, 2019 |
Submitted our latest paper to ICLR 2020.
|
Aug 13, 2019 |
Joined AI2 as a Postdoctoral Young Investigator.
|
Jun 2, 2019 |
Presented our tutorial on Transfer Learning in Natural Language Processing at NAACL 2019 in Minnesota.
|
May 8, 2019 |
Defended my PhD thesis!
|