I'm a PhD student at UC Berkeley School of Information and BAIR, working on natural language processing (NLP) and computational social science. I'm supported by a NSF Graduate Research Fellowship and advised by David Bamman.
I have a M.S. in Computer Science (artificial intelligence) and B.S. in Symbolic Systems (≈ cognitive science) from Stanford. In the past I've worked in the Stanford NLP Group with Dora Demszky, the EPFL Data Science Lab with Robert West, and the SNAP Group with David Jurgens. I taught machine learning to high schoolers with AI4ALL at Stanford and Berkeley, and I organized these Medium posts about NLP pedagogy.
This is my academic CV.
I'm interested in computational sociolinguistics and using text analysis to shed light on social science questions.
How do people talk about other people?
People are people-centric, and much of what they say focuses on others. I use NLP methods to quantify the discussion of individuals and social groups in text.
What's in the long tail of language on the Internet?
Social media is filled with communities that use distinctive language, such as innovative or unique words and meanings. I'm mapping out online linguistic landscapes and looking at how they evolve over time.
I publish with my name backwards, so citations should refer to "L. Lucy".
* = equal contribution
Content Analysis of Textbooks via Natural Language Processing: Findings on Gender, Race, and Ethnicity in Texas U.S. History Textbooks
Li Lucy*, Dora Demszky*, Patricia Bromley, Dan Jurafsky.
AERA Open 2020.
Best Paper Award. Paper. Code. Slides.
I have a small, curated collection of plants.
I designed Snapchat geofilters that have over 26 million views worldwide.
I have been Yelp Elite for four years.
In high school, I interned on the production team for the Emmy-award-winning PBS Kids show, SciGirls.