news | Swabha Swayamdipta

Apr 25, 2025	Honored to receive a sponored research grant by Apple.
Apr 23, 2025	DILL lab has newly minted entrepreneurs: Jaspreet Ranjit and Aryan Gulati are the Min Family Challenge winners in 2025.
Apr 23, 2025	DILL Lab wins two awards at ShowCAIS 2025: best poster by undergrad Risha Surana and runner-up best oral presentation by Jaspreet Ranjit.
Apr 08, 2025	DILL Lab students, Matt Finlayson and Ryan Wang (who’s joining UC Berkeley soon) got the NSF Graduate Research Fellowship this year!
Mar 31, 2025	Co-organizing The Futures of Language Models and Transformers this week with Sasha Rush, as part of the Special Program on LLMs (Part 2).
Jan 14, 2025	I’ll be spending most of spring at the Simons Institute, attending the Special Program on LLMs (Part 2). Come say hi if you are in Berkeley!
Jan 06, 2025	Starting a new 20% role as an Amazon Visiting Academic in AWS Bedrock.
Dec 18, 2024	DILL lab undergrads, Aryan Gulati and Ryan Wang received CRA Outstanding Undergraduate Researcher Awards!
Nov 14, 2024	We won an outstanding paper award at EMNLP 2024 for our work on OATH frames!
Jul 11, 2024	Honored to receive an NSF CISE grant on a Collaborative Research Proposal with Co-PIs Robin Jia, Jordan Boyd-Graber, Alvin Grissom III and John Lalor.
May 08, 2024	Excited to receive the USC Office of Research and Innovation’s Zumberge Preliminary Studies DEI in Research Award with my co-PI Eric Rice.
Apr 22, 2024	Thrilled to be awarded a Research Fellowship at the Simons Institute at UC Berkeley for spring 2025 under the program on “Large Language Models and Transformers, Part II.”
Nov 30, 2023	Excited to be appointed as an Associate Director of USC’s Center for AI in Society.
Sep 11, 2023	Honored to receive the Intel Rising Stars Award in 2023!
Jun 23, 2023	Taught NLP and Language Models to high-school students via USC Viterbi K-12 Discover Engineering. Slides adapted from Greg Durrett (thanks!).
May 26, 2023	Made an appearance on ABC7 Live where I talked about the impact of AI.
May 09, 2023	New preprint on NeuroComparatives now available!
May 02, 2023	Three papers accepted to ACL 2023: REV and I2D2 at the main conference and COBRA to Findings.
Apr 27, 2023	New Preprint on Ambiguity in natural language and how LLMs handle it: AMBIENT.
Apr 05, 2023	Got invited as a speaker at the USC Sidney Harmon Academy for Polymathic Studies for a discussion with Kate Crawford on AImaginings, a discussion on AI’s potential to memorize and imagine.
Feb 28, 2023	Got featured in an USC Viterbi news article on ChatGPT.
Feb 28, 2023	Gave an invited talk on Designing Controls and Filters for Dataset Generation at Spotify Research Seminar.
Feb 03, 2023	Gave an invited talk on Interpreting datasets to enable better data creationat the Data-Centric AI seminar series at Amazon.
Feb 01, 2023	Gave an invited talk on Contextualizing Bias in Hate Speech Detection at the CAIS++ Seminar.
Jan 20, 2023	Submitted four long papers to ACL along with collaborators from AI2, UW, CMU and Intel.
Dec 10, 2022	Taught a class on Data-Centric Machine Learning at the Online Asian Machine Learning School (OAMLS) at ACML 2022.
Nov 29, 2022	Heading over to EMNLP 2022 in Abu Dhabi next week, hope to meet lots of old and new friends!
Nov 21, 2022	Was interviewed by the MIT Technology Review on dataset quantity and quality: read the coverage here.
Nov 18, 2022	Attended the SoCalNLP Symposium hosted by UCSB, along with the DILL Lab. Gave an invited talk on my research on generating datasets!
Nov 09, 2022	Gave an invited talk at the USC CAIS Seminar on my research on Hate Speech Detection - watch it here!
Nov 08, 2022	Presented an overview of my research to the USC Viterbi CS Department Advisory Board.
Oct 06, 2022	Three papers accepted to EMNLP / Findings: WaNLI, NeuroCounterfactuals and Investigating Free-Form Rationales.
Aug 22, 2022	Started teaching my first class: CSCI 699 - Data-Centric NLP.
Aug 16, 2022	Started as an Assistant Professor of Computer Science at USC Viterbi School of Engineering, where I’m launching the DILL Lab.
Jul 29, 2022	Last day as a Young Investigator at AI2… I’ll miss being here but so excited about USC!
Jul 19, 2022	Super thrilled to receive an outstanding paper award at ICML 2022 for our work on V-info!
Jul 17, 2022	Attending ICML 2022 virtually, where Kawin be presenting our work on understanding dataset difficulty.
Jul 14, 2022	Had a blast serving as a panelist in the DADC workshop at NAACL. Also our DeepLo 2022 Workshop was quite successful!
Jul 10, 2022	Attending NAACL 2022, where we’ll be presenting Reframing Human-AI Collaboration and Annotators with Attitudes.
Jun 17, 2022	Served as a panelist on the Responsible AI Symposium 2022 for AILA.
May 23, 2022	Honored to have been voted as an invited speaker for ACL STIRS, where I presented my work on datasets!
May 15, 2022	Paper on Dataset Difficulty with my former intern, Kawin Ethayarajh, now accepted at ICML as a long presentation.
May 13, 2022	Had a wonderful time visiting UC Irvine at the CS Seminar; first in-person invited talk for me in a long time!
Apr 15, 2022	We hired a brilliant new incoming cohort of PhD students at USC NLP for Fall 2022! More details soon!
Apr 07, 2022	Two upcoming papers at NAACL: Annotators with Attitudes and Reframing Human-AI Collaboration for Generating Free-Text Explanations.
Mar 16, 2022	Excited to present work on Dataset Construction at IFDS Ethics Seminar and at MilaNLP’s Coding Aperitivo this week.
Jan 15, 2022	New preprint on WaNLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation.
Dec 16, 2021	New preprint on Human-AI Collaboration for Generating Free-Text Explanations with Jack Hessel and our intern, Sarah Wiegreffe.
Nov 30, 2021	🏆 Our paper on MAUVE received an outstanding paper award at NeurIPS 2021 making it one of 6 papers out of 9K submissions!
Nov 21, 2021	The third edition DeepLo for NLP workshop to be collocated with NAACL 2022.
Nov 15, 2021	New preprint on Annotators with Attitudes, presented at the Text as Data conference.
Oct 20, 2021	Invited virtual talk at the Machine Learning Research Group at Oracle.
Oct 15, 2021	New preprint on Information-Theoretic Measures of Dataset Difficulty with my intern, Kawin Ethayarajh.
Oct 12, 2021	Attended the Trustworthy NLP Workshop at Google.
Sep 28, 2021	Paper on MAUVE, an Information Divergence Measure between Neural Text and Human Text, accepted as a NeurIPS 2021 oral!
Sep 13, 2021	Paper on Data Augmentation for Frame-SRL, with Miriam R. L. Petruck to appear at the LAW-DMR workshop at EMNLP 2021.
Sep 07, 2021	Abstract on Biases in Toxic Language Detection:The Role of Annotator Beliefs And Demographics accepted to Text As Data 2021. Full paper coming soon!
Aug 26, 2021	Paper with my intern, Alon Jacovi, on Contrastive Explanations for Model Interpretability to appear at EMNLP 2021.
May 05, 2021	Paper on On-the-Fly Controlled Text Generation with Experts and Anti-Experts to appear at ACL 2021.
Mar 04, 2021	Guest lecture on Transfer Learning at UW Stats: UW DATA 598 Statistical Deep Learning, taught by Zaid Harachoui.
Mar 01, 2021	Check out our new pre-print on contrastive explanations for model decisions! Work with my intern Alon Jacovi and others!
Feb 24, 2021	Talk at the NERT Seminar at Georgetown University! So honored to be an elected speaker :)
Feb 12, 2021	Invited talk at the NLP Seminar at Georgia Tech!
Feb 03, 2021	Check out our new pre-print on an evaluation metric for open-ended text generation, MAUVE !
Feb 01, 2021	New ACL submission on controlled generation, with exciting applications. Keep an eye out!
Jan 11, 2021	Paper on Challenges in Social Bias Mitigation in Hate Speech Detection to appear at EACL 2021!
Dec 03, 2020	Guest lecture in Eunsol Choi’s Topics in NLP class at UT Austin on Biases and Interpretability.
Nov 02, 2020	Was delighted to be an invited speaker for Responsible AI at the Microsoft E+D Product Leaders Conference.
Sep 15, 2020	Paper on Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics is now accepted to the Proceedings of EMNLP, and GDaug is accepted to Findings of EMNLP.
Aug 13, 2020	Completed one year as a postdoctoral investigator at AI2!
Jul 08, 2020	Our paper Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks received an Honorable Mention Award at ACL 2020!
Jun 03, 2020	New submission to EMNLP finally done, shoot me an email if you’d like to learn more.
May 31, 2020	Our paper on Adversarial Filters of Dataset Biases has been accepted to ICML!
Apr 11, 2020	New EMNLP preprint on Generative Data Augmentation for Commonsense Reasoning, or G-DAUG now available on arXiv.
Apr 03, 2020	Two papers accepted to ACL! Kudos to my wonderful collaborators on Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks and The Right Tool for the Job: Matching Model and Instance Complexities.
Feb 02, 2020	New preprint on Adversarial Filters of Dataset Biases now available on arXiv.
Oct 17, 2019	Invited talk at the UW Linguistics Colloquium on Oct 18, 2019.
Oct 02, 2019	Got selected to attend the Rising Stars in EECS workshop at the University of Illinois at Urbana-Champaign from Oct 29 - Nov 1, 2019.
Sep 25, 2019	Submitted our latest paper to ICLR 2020.
Aug 13, 2019	Joined AI2 as a Postdoctoral Young Investigator.
Jun 02, 2019	Presented our tutorial on Transfer Learning in Natural Language Processing at NAACL 2019 in Minnesota.
May 08, 2019	Defended my PhD thesis!