The purpose of this preparatory lab is to introduce you to the basics of PyTorch, the deep learning framework we use for the lab series. Many good introductions to PyTorch are available online, including the 60 Minute Blitz on the official PyTorch website. For this course we have put together a notebook that focuses on those basics that you will encounter in the labs.

Introduction to PyTorch (notebook, live notebook, video)

Lab 1: Word representations

To process words using neural networks, we need to represent them as vectors of numerical values.

Level A

In this lab you will implement the skip-gram model with negative sampling (SGNS) from Lecture 1.4, and use it to train word embeddings on the text of the Simple English Wikipedia.

Lab L1: Word representations (due 2023-01-27)

Additional material: Sampling words by frequency (notebook, live notebook)

Level B

In Lecture 1.3 you learned about the CBOW classifier. This classifier is easy to implement in PyTorch with its automatic differentiation magic; but it is easy also to forget about what is going on under the hood. Your task in this lab is to implement the CBOW classifier without any magic, using only a library for vector operations (NumPy).

Lab L1X: Under the hood of the CBOW classifier (due 2023-01-27)

Lab 2: Language modelling

Language modelling is about building models of what words are more or less likely to occur in some language.

Level A

In this lab you will implement and train two neural language models: the fixed-window model from Lecture 2.3 and the recurrent neural network model from Lecture 2.5. You will evaluate these models by computing their perplexity on a benchmark dataset for language modelling, the WikiText dataset.

Lab L2: Language modelling (due 2023-02-03)

Level C

While the neural models that you have seen in the base lab define the state of the art in language modelling, they require substantial computational resources. Where these are not available, the older generation of probabilistic language models can make a strong baseline. Your task in this lab is to evaluate one of these models on the WikiText dataset.

Lab L2X: Interpolated n-gram models (due 2023-02-03)

Lab 3: Large language models

The labs in this unit introduce you to the large language models that power the recent advances in natural language processing. These architectures are based on the Transformer architecture, whose core is formed by a mechanism called attention.

Level A

In this lab you will implement a simple encoder–decoder architecture for machine translation, including the extension of this architecture by an attention mechanism. You will evaluate this architecture on a parallel German–English dataset.

Lab L3: Attention (due 2023-02-10)

Level B

One of the main selling points of pre-trained language models is that they can be applied to a wide spectrum of different tasks in natural language processing. In this lab you will test this by fine-tuning a pre-trained BERT model on a benchmark task in natural language inference.

Lab L3X: BERT for Natural Language Inference (due 2023-02-10)

Lab 4: Sequence labelling

Sequence labelling is the task of assigning a class label to each item in an input sequence. The labs in this unit will focus on the task of part-of-speech tagging.

Level A

In this lab you will implement a simple part-of-speech tagger based on the fixed-window architecture, and evaluate this tagger on the English treebank from the Universal Dependencies Project, a corpus containing more than 16,000 sentences (254,000 tokens) annotated with, among others, parts of speech.

Lab L4: Part-of-speech tagging (due 2023-02-17)

Level B

In the advanced part of this lab, you will practice your skills in feature engineering, the task of identifying useful features for a machine learning system – in this case a part-of-speech tagger based on the averaged perceptron.

Lab L4X: Feature engineering for part-of-speech tagging (due 2023-02-17)

Lab 5: Syntactic analysis

Syntactic analysis, also called syntactic parsing, is the task of mapping a sentence to a formal representation of its syntactic structure.

Level A

In this lab you will implement a transition-based dependency parser based on the fixed-window neural architecture, and evaluate it on the English Web Treebank from the Universal Dependencies Project.

Lab L5: Dependency parsing (due 2023-02-24)

Level C

In this lab you will implement the Eisner algorithm for projective dependency parsing and use it to transform (possibly non-projective) dependency trees to projective trees. This transformation is necessary to be able to apply algorithms for projective dependency parsing to treebanks that may contain non-projective trees.

Lab L5X: Projectivization (due 2023-02-24)

Lab 0: Introduction to PyTorch

Lab 1: Word representations

Level A

Level B

Lab 2: Language modelling

Level A

Level C

Lab 3: Large language models

Level A

Level B

Lab 4: Sequence labelling

Level A

Level B

Lab 5: Syntactic analysis

Level A

Level C