Homework 4: Applications of Word Embeddings

Due Thursday March 14, 11:59PM EST

Gradescope Colab Notebook Late Day Request Form (accepted up to 24 hours before deadline)
In this homework assignment, we will be exploring exploring word embeddings through the implementation of algorithms from two recent papers.
Learning Goals:
Once you complete this assignment, you should:

Be familiar with some applications of word embeddings
Be able to express one way in which word embeddings can replicate human biases
Have experience reading documentation for a new library to write efficient code
Have experience turning algorithms from NLP papers into code

This assignment is connected to the following overall learning goals of the course:

Be familiar with NLP methods in three key areas: text classification, text generation, and language understanding
Be able to effectively use python libraries that are part of the large ecosystem of tools for NLP
Be exposed to ways in which language technology can perpetuate stereotypes and biases related to minoritized groups, and inequity among different languages
Demonstrate your ability to engage with recent research papers in NLP

Submit these files: HW4.ipynb
Leaderboard: In the leaderboard for this assignment, you will be ranked according to the number of relevant terms in your top 50 list. See details under task 2.4. Your first two submissions without errors each day will count towards the leaderboard.
Credits: This assignment is inspired by papers written by Aylin Caliskan and Hila Gonen.

When you submit to gradescope, your file must be called exactly HW4.ipynb.

This homework assignment differs from homework assignments 1-3 in that you will complete it in a Colab notebook. The instructions are all listed in the notebook!

To submit this homework, you should download your notebook as a .ipynb file and submit that to gradescope (click File » Download » Download .ipynb). That is the only file you’ll need to submit (it will include your report).