When will I have access to the lectures and assignments?

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

What will I get if I purchase the Certificate?

When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Is financial aid available?

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Natural Language Processing

4 days left! Save on skills that make you shine with 40% off 3 months of Coursera Plus. Save now

Natural Language Processing

Instructors: MD Husnain

Included with

Learn more

12 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

6 weeks to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

12 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

6 weeks to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

What you'll learn

Understand and recall core concepts and techniques in Natural Language Processing (NLP).
Analyse and evaluate NLP methods for varied tasks, considering performance, context, and suitability.
Design and develop real-world NLP applications by integrating multiple techniques.

Skills you'll gain

Tools you'll learn

Details to know

Shareable certificate

Add to your LinkedIn profile

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 12 modules in this course

Are you curious about how chatbots hold conversations or how ChatGPT generates human-like responses? This course in Natural Language Processing (NLP) is your gateway into the fascinating world where language meets AI. Designed for students and professionals alike, the course blends essential theory with hands-on experience to equip you with the skills needed to build intelligent language systems.

We start by unravelling what makes language so complex—and why teaching machines to understand it is such a challenging task. You’ll explore the inner workings of Natural Language Understanding (NLU) and Generation (NLG), investigate real-world NLP applications, and dive into current trends like large language models (LLMs) and transformer-based systems. From there, you’ll roll up your sleeves and learn core NLP techniques like tokenization, stemming, lemmatization, and sentence segmentation. You’ll master vector-based approaches like Bag of Words and TF-IDF, then progress to powerful word embeddings like Word2Vec, Skip-gram, and GloVe. As you advance, you'll build language models, train simple neural networks, and explore cutting-edge tools in POS tagging, syntactic parsing, and semantic analysis. You’ll even touch the future with knowledge graphs and Word Sense Disambiguation. By the end, you’ll be ready to innovate in the fast-evolving NLP landscape. Graduates of this NLP course can pursue roles such as NLP Engineer, Machine Learning Engineer, or Data Scientist with a focus on language technologies. Opportunities also exist in AI-driven fields like chatbots, voice assistants, sentiment analysis, and information retrieval. Advanced learners may explore careers in research, LLM fine-tuning, or knowledge graph development. Are you ready to unlock the power of cutting-edge NLP skills? Join us on this exciting journey into the world of language, AI, and intelligent data processing!

Module details

In this module, the learners will be introduced to the course and its syllabus, setting the foundation for their learning journey. The course's introductory video will provide them with insights into the valuable skills and knowledge they can expect to gain throughout the duration of this course. Additionally, the syllabus reading will comprehensively outline essential course components, including course values, assessment criteria, grading system, schedule, details of live sessions, and a recommended reading list that will enhance the learner’s understanding of the course concepts. Moreover, this module offers the learners the opportunity to connect with fellow learners as they participate in a discussion prompt designed to facilitate introductions and exchanges within the course community.

What's included

2 videos1 reading1 discussion prompt

This module introduces the fundamental concepts of Natural Language Processing (NLP). It begins with the definition of NLP and explores a variety of real-world applications. You will gain an understanding of Natural Language Understanding (NLU) and Natural Language Generation (NLG). The module also covers key evaluation metrics used to assess NLP systems. Additionally, a hands-on lab session will guide you through the implementation of basic NLP preprocessing techniques.

What's included

13 videos4 readings12 assignments1 discussion prompt

13 videosTotal 76 minutes

NLP Definition3 minutes
NLP Applications5 minutes
Why NLP is Hard?10 minutes
Natural Language Understanding 4 minutes
Levels of Language Understanding5 minutes
Natural Language Generation4 minutes
Organisation of NLP System6 minutes
Intrinsic vs. Extrinsic Evaluation4 minutes
Challenges in Evaluation4 minutes
NLP Tools Overview7 minutes
Demo of NLP Tools6 minutes
Basic NLP Application Development Using NLP Tools13 minutes
Module Wrap-Up6 minutes

4 readingsTotal 60 minutes

Recommended Reading: What is NLP?15 minutes
Recommended Reading: NLP Fundamentals15 minutes
Recommended Reading: Evaluation of NLP Systems15 minutes
Recommended Reading: NLP Tools Introduction15 minutes

12 assignmentsTotal 45 minutes

NLP Definition6 minutes
NLP Applications3 minutes
Why NLP is a Hard Problem3 minutes
Natural Language Understanding 3 minutes
Levels of Language Understanding3 minutes
Natural Language Generation3 minutes
Organisation of NLP System3 minutes
Intrinsic vs. Extrinsic Evaluation6 minutes
Challenges in Evaluation3 minutes
NLP Tools Overview6 minutes
Demo of NLP Tools3 minutes
Basic NLP Application Development Using NLP Tools3 minutes

1 discussion promptTotal 30 minutes

Real-World Challenges and Tools in Natural Language Processing30 minutes

This module introduces essential NLP preprocessing techniques. It begins with regular expressions for text pattern matching, followed by an overview of words and corpora as foundational data sources. Sentence segmentation and tokenization are then covered through practical demonstrations. Finally, the module explores normalization, lemmatization, and stemming as methods to standardise text, with a demo highlighting their differences and effects.

What's included

14 videos5 readings14 assignments1 discussion prompt

14 videosTotal 79 minutes

Regular Expressions8 minutes
Words and Corpora5 minutes
Sentence Segmentation3 minutes
Code Demo Segmentation5 minutes
Tokenization5 minutes
Tokenization Methods7 minutes
Code Demo Tokenization14 minutes
Normalization 4 minutes
Code Demo Normalization 4 minutes
Stemming6 minutes
Code Demo Stemming5 minutes
Lemmatization 3 minutes
Code Demo Lemmatization6 minutes
Module Wrap-Up4 minutes

5 readingsTotal 130 minutes

Recommended Reading: Basic Text Preprocessing35 minutes
Recommended Reading: Segmentation and Tokenization 30 minutes
Recommended Reading: Normalization20 minutes
Recommended Reading: Stemming and Lemmatization30 minutes
Instructional Document: Staff-Graded Assignment-115 minutes

14 assignmentsTotal 99 minutes

Regular Expressions3 minutes
Words and Corpora3 minutes
Sentence Segmentation3 minutes
Code Demo Segmentation3 minutes
Tokenization3 minutes
Tokenization Methods3 minutes
Code Demo Tokenization3 minutes
Normalization 3 minutes
Code Demo Normalization3 minutes
Stemming3 minutes
Code Demo Stemming3 minutes
Lemmatization3 minutes
Code Demo Lemmatization3 minutes
Graded Quiz: Modules 1 and 260 minutes

1 discussion promptTotal 30 minutes

Building a Preprocessing Pipeline: Challenges and Solutions30 minutes

This module explores lexical and vector semantics, focusing on computational representations of word meaning. It covers word vectors, Bag of Words, and co-occurrence matrices to capture contextual relationships. Techniques such as TF-IDF are introduced to measure word importance, along with methods for computing word similarity. Practical examples and mathematical exercises on TF-IDF help reinforce these core NLP concepts.

What's included

13 videos3 readings10 assignments1 discussion prompt

13 videosTotal 72 minutes

Lexical Semantics 3 minutes
Why Vectors?7 minutes
Word and Vectors8 minutes
Bag of Words4 minutes
Computing Word Similarity3 minutes
Cosine Similarity4 minutes
Cosine Similarity Example7 minutes
Term Frequency4 minutes
Inverse Document Frequency11 minutes
TF-IDF7 minutes
Demo of Words as Vectors4 minutes
Demo of TF-IDF8 minutes
Module Wrap-Up4 minutes

3 readingsTotal 45 minutes

Recommended Reading: Foundations of Lexical and Vector Semantics 15 minutes
Recommended Reading: Representing Text Using Vectors 15 minutes
Recommended Reading: Term and Inverse Document Frequency 15 minutes

10 assignmentsTotal 30 minutes

Lexical Semantics 3 minutes
Why Vectors? 3 minutes
Word and Vectors 3 minutes
Bag of Words3 minutes
Computing Word Similarity 3 minutes
Cosine Similarity 3 minutes
Cosine Similarity Example 3 minutes
Term Frequency 3 minutes
Inverse Document Frequency 3 minutes
TF-IDF 3 minutes

1 discussion promptTotal 20 minutes

Applying Vector Semantics in a Real-World Scenario20 minutes

This module introduces Word Embeddings, focusing on the transition from sparse to dense vector representations of words. It covers Word2Vec models, including Skip-gram and CBOW, explained with simple, intuitive examples. The module also explores GloVe embeddings, which capture global word co-occurrence statistics for improved semantic understanding. Learners will visualise word embeddings to gain insights into how words relate in vector space. Finally, the module highlights real-world applications of word embeddings in NLP tasks like sentiment analysis, machine translation, and question answering.

What's included

13 videos3 readings14 assignments1 discussion prompt

13 videosTotal 79 minutes

Word2Vec 4 minutes
Basic 1-Hot Word Representation4 minutes
Feature Based Word Representations3 minutes
Skip Gram Algorithm Introduction6 minutes
Skip Gram Probabilities8 minutes
Skip-Gram Negative Sampling (SGNS) Approach7 minutes
Skip-Gram Negative Training Data Example7 minutes
SGNS Log Loss Function7 minutes
Derivative of SGNS Loss Function6 minutes
SGNS Example Part 112 minutes
SGNS Example Part 28 minutes
Continuous Bag of Words (CBOW)5 minutes
Module Wrap Up 4 minutes

3 readingsTotal 45 minutes

Recommended Reading: Basics of Word2Vec 15 minutes
Recommended Reading: Skip-Gram Word Embedding 15 minutes
Other Word2Vec Approaches Title: Essential Reading Material – CBOW and GloVe 15 minutes

14 assignmentsTotal 396 minutes

Word2Vec3 minutes
Basic 1-Hot Word Representation3 minutes
Feature Based Word Representations3 minutes
Skip Gram Algorithm Introduction3 minutes
Skip Gram Probabilities3 minutes
Skip-Gram Negative Sampling (SGNS) Approach3 minutes
Skip-Gram Negative Training Data Example3 minutes
SGNS Log Loss Function3 minutes
Derivative of SGNS Loss Function3 minutes
SGNS Example Part 13 minutes
SGNS Example Part 23 minutes
Continuous Bag of Words (CBOW)3 minutes
Graded Quiz - Modules 3 and 460 minutes
SGA-1 Submission: Word Embedding300 minutes

1 discussion promptTotal 20 minutes

The Power of Dense Vectors: Choosing an Embedding Model20 minutes

This module introduces Language Modeling (LM) and its role in predicting word sequences in natural language. It explores practical applications of LMs and explains N-gram models, including challenges like generalization and handling zero probabilities. Techniques such as smoothing and stupid backoff are covered to improve model robustness. The module concludes with methods for evaluating language models using standard metrics.

What's included

15 videos4 readings13 assignments1 discussion prompt

15 videosTotal 96 minutes

What is Language Modeling?3 minutes
Language Modelling Applications 3 minutes
How to Build a Language Model 5 minutes
Markov Assumption 2 minutes
N-gram Language Models4 minutes
Bi-gram Computation10 minutes
Raw Probabilities10 minutes
Perils of Overfitting3 minutes
Laplace Smoothing14 minutes
Interpolation & Backoff10 minutes
How Good is the Model?3 minutes
Extrinsic Evaluation5 minutes
Perplexity & It's Example9 minutes
Module Demo10 minutes
Module Wrap-Up5 minutes

4 readingsTotal 60 minutes

Recommended Reading: Language Modelling Introduction15 minutes
Recommended Reading: N-grams 15 minutes
Recommended Reading: Smoothing 15 minutes
Recommended Reading: Language Modelling Evaluation 15 minutes

13 assignmentsTotal 39 minutes

What is Language Modeling? 3 minutes
Language Modelling Applications 3 minutes
How to Build a Language Model 3 minutes
Markov Assumption3 minutes
N-gram Language Models 3 minutes
Bi-gram Computation 3 minutes
Raw Probabilities 3 minutes
Perils of Overfitting 3 minutes
Laplace Smoothing3 minutes
Interpolation & Backoff3 minutes
How Good is the Model?3 minutes
Extrinsic Evaluation 3 minutes
Perplexity & its Example3 minutes

1 discussion promptTotal 20 minutes

Balancing Simplicity and Performance in Language Modelling20 minutes

This module explores the use of Neural Networks in Language Modelling, starting with the fundamentals of Feed-Forward Neural Networks and their training process for language tasks. It introduces Neural Language Models, which capture complex patterns in text beyond traditional statistical methods. The module also provides a foundational understanding of Large Language Models (LLMs) and their capabilities. Finally, it introduces Prompt Engineering as a technique to effectively interact with and guide LLMs for various NLP applications.

What's included

17 videos6 readings16 assignments1 discussion prompt

17 videosTotal 98 minutes

Neural Network Unit3 minutes
Non-Linear Activation Functions5 minutes
Perceptron with Examples4 minutes
Multi-Layer Perceptron8 minutes
Softmax Function with Example4 minutes
Feed Connected Neural Network4 minutes
Feedforward Network5 minutes
Forward Algorithm4 minutes
Backpropagation Algorithm5 minutes
Training Neural Network12 minutes
Neural Language Modeling6 minutes
Training Neural Language Model9 minutes
N-gram Versus Neural Language Model4 minutes
Neural LM Demo10 minutes
What is LLM?6 minutes
LLM Use Cases5 minutes
Module Wrap Up3 minutes

6 readingsTotal 105 minutes

Recommended Reading: Introduction to Neural Network15 minutes
Recommended Reading: Feed Forward Neural Network 15 minutes
Recommended Reading: Training Neural Network 15 minutes
Recommended Reading: Neural Language Models 15 minutes
Recommended Reading: Introduction to Large Language Models 30 minutes
Instructional Document: Staff-Graded Assignment-215 minutes

16 assignmentsTotal 105 minutes

Neural Network Unit3 minutes
Non-Linear Activation Functions3 minutes
Perceptron with Examples3 minutes
Multi-Layer Perceptron3 minutes
Softmax Function with Example3 minutes
Feed Connected Neural Network3 minutes
Feed Forward Network3 minutes
Forward Algorithm3 minutes
Backpropagation Algorithm3 minutes
Training Neural Network3 minutes
Neural Language Modeling3 minutes
Training Neural Language Model3 minutes
N-gram Versus Neural Language Model3 minutes
What is LLM?3 minutes
LLM Use Cases3 minutes
Graded Quiz - Modules 5 and 660 minutes

1 discussion promptTotal 20 minutes

The Next Generation of Language Modelling: From N-grams to LLMs20 minutes

This module provides an introduction to Part-of-Speech (POS) Tagging, techniques to perform POS Tagging and their applications in NLP. POS tagging is a fundamental task in Natural Language Processing (NLP) that involves assigning grammatical categories (like noun, verb, adjective) to words in text. Starting from basic linguistic foundations and real-world applications, the module dives into the evolution of POS tagging techniques—from statistical models like Hidden Markov Models (HMMs) and Maximum Entropy classifiers, to modern deep learning approaches using Recurrent Neural Networks (RNNs). Learners will gain a strong theoretical understanding and insight into how POS tagging supports downstream tasks like parsing, named entity recognition, and machine translation. The module includes a hands-on coding demonstration for POS tagging.

What's included

13 videos5 readings11 assignments1 discussion prompt

13 videosTotal 74 minutes

Outline of the Module 2 minutes
What is POS Tagging? 6 minutes
Challenges in POS Tagging4 minutes
POS Tagsets 6 minutes
Markov Chain5 minutes
Hidden Markov Model5 minutes
Hidden Markov Model as POS Tagger 6 minutes
Viterbi Algorithm 8 minutes
Viterbi Algorithm - Example8 minutes
Logistic Regression - Overview9 minutes
Multinomial Logistic Regression - Overview6 minutes
Maximum Entropy Markov Models (MEMM)7 minutes
Module Wrap Up2 minutes

5 readingsTotal 110 minutes

Code Document: POS tagging using NLTK / spaCy 10 minutes
Recommended Reading: Introduction to POS Tagging and Applications 30 minutes
Code Document: Demonstrating HMM Based POS Tagger10 minutes
Recommended Reading: HMM for POS Tagging 30 minutes
Recommended Reading: Maximum Entropy Markov Models30 minutes

11 assignmentsTotal 33 minutes

What is POS Tagging?3 minutes
Challenges in POS Tagging3 minutes
POS Tagsets 3 minutes
Markov Chain3 minutes
Hidden Markov Model3 minutes
Hidden Markov Model as POS Tagger 3 minutes
Viterbi Algorithm 3 minutes
Viterbi Algorithm - Example3 minutes
Logistic Regression - Overview3 minutes
Multinomial Logistic Regression - Overview3 minutes
Maximum Entropy Markov Models (MEMM)3 minutes

1 discussion promptTotal 30 minutes

POS Tagging: The Right Tool for the Job30 minutes

This module introduces students to the syntactic structure of natural language and its critical role in Natural Language Processing (NLP) applications. Parsing is the task of assigning a structured representation—typically a tree—to a sentence, revealing the grammatical relationships between its components. The module begins by revisiting Context-Free Grammars (CFGs) and how they form the foundation for syntactic parsing. We explore Constituent Parsing, introducing classical parsing techniques such as the CKY (Cocke-Kasami-Younger) algorithm. The module then transitions to modern span-based neural parsing approaches that use neural networks to score and predict parse trees. A significant portion of the module is dedicated to Dependency Parsing, where syntactic structure is represented through direct relationships between words rather than phrases. Students will study both transition-based and graph-based dependency parsers, gaining insight into their strengths, algorithmic designs, and practical performance. Throughout the module, we emphasise real-world NLP applications.

What's included

18 videos4 readings18 assignments1 discussion prompt

18 videosTotal 88 minutes

Outline of the Module 2 minutes
Introduction to Context-Free Grammars (CFGs)8 minutes
Constituency and Phrase Structure5 minutes
Ambiguity in Grammar4 minutes
Chomsky Normal Form (CNF) and Grammar Normalisation5 minutes
Treebanks and Empirical Grammar3 minutes
CKY Algorithm7 minutes
CKY Algorithm - Walkthrough8 minutes
Parse Tree Recovery From CKY Table5 minutes
Neural Span-based Constituency Parsing5 minutes
What is Dependency Parsing?5 minutes
Dependency Formalism5 minutes
Universal Dependency Relations4 minutes
Transition-Based Dependency Parsing 6 minutes
Transition-Based Dependency Parsing - Walkthrough5 minutes
Creating an Oracle 4 minutes
Graph-Based Dependency Parsing5 minutes
Module Wrap Up2 minutes

4 readingsTotal 120 minutes

Recommended Reading: Review of Context-Free Grammars and Parsing in NLP 30 minutes
Recommended Reading: Constituency Parsing and CKY Algorithm 30 minutes
Recommended Reading: Dependency Parsing – Theory and Representations 30 minutes
Recommended Reading: Dependency Parsing Algorithms and Modern Applications 30 minutes

18 assignmentsTotal 411 minutes

Introduction to Context-Free Grammars (CFGs)3 minutes
Constituency and Phrase Structure3 minutes
Ambiguity in Grammar3 minutes
Chomsky Normal Form (CNF) and Grammar Normalisation3 minutes
Treebanks and Empirical Grammar3 minutes
CKY Algorithm3 minutes
CKY Algorithm - Walkthrough3 minutes
Parse Tree Recovery From CKY Table3 minutes
Neural Span-based Constituency Parsing3 minutes
What is Dependency Parsing?3 minutes
Dependency Formalism3 minutes
Universal Dependency Relations3 minutes
Transition-Based Dependency Parsing 3 minutes
Transition-Based Dependency Parsing - Walkthrough6 minutes
Creating an Oracle3 minutes
Graph-Based Dependency Parsing3 minutes
Graded Quiz: Modules 7 and 860 minutes
SGA-2: POS Tagging and Parsing300 minutes

1 discussion promptTotal 30 minutes

Parsing Frameworks: Constituent vs. Dependency30 minutes

This module explores the semantic dimension of natural language by covering both lexical semantics—including word senses, ambiguity, and disambiguation techniques—and the semantic web—a framework for enabling machine-readable, structured understanding of web data. The module starts with foundational concepts in lexical semantics and WordNet, then proceeds to classical and modern word sense disambiguation (WSD) methods. The second part focuses on Semantic Web technologies, covering ontologies, knowledge graphs, RDF/OWL, and their role in enabling intelligent systems and knowledge-driven NLP applications.

What's included

17 videos5 readings14 assignments1 discussion prompt

17 videosTotal 85 minutes

Outline of the Module1 minute
What is a Word Sense?3 minutes
Homonymy vs Polysemy7 minutes
Sense Relations7 minutes
Introduction to WordNet and Synsets7 minutes
Relations in WordNet5 minutes
Navigating WordNet Hierarchies and Graph Structures5 minutes
What is Word Sense Disambiguation? 4 minutes
Supervised WSD8 minutes
Knowledge-Based WSD: Lesk Algorithm5 minutes
From Syntactic Web to Semantic Web: What's the Problem?6 minutes
Semantic Web Vision: Data Integration and Automation3 minutes
Ontologies4 minutes
Ontology Languages and Their Layers9 minutes
What is a Knowledge Graph? 3 minutes
Applications in NLP6 minutes
Module Wrap Up1 minute

5 readingsTotal 130 minutes

Recommended Reading: Word Senses and Lexical Semantics30 minutes
Code Document: Querying WordNet in Python (using nltk.corpus.wordnet)10 minutes
Recommended Reading: WordNet and Semantic Lexicons30 minutes
Recommended Reading: Word Sense Disambiguation (WSD)30 minutes
Recommended Reading: Introduction to the Semantic Web and Ontologies30 minutes

14 assignmentsTotal 42 minutes

What is a Word Sense? 3 minutes
Homonymy vs Polysemy3 minutes
Sense Relations3 minutes
Introduction to WordNet and Synsets3 minutes
Relations in WordNet3 minutes
Navigating WordNet Hierarchies and Graph Structures3 minutes
What is Word Sense Disambiguation?3 minutes
Supervised WSD3 minutes
Knowledge-Based WSD: Lesk Algorithm3 minutes
Semantic Web Vision: Data Integration and Automation3 minutes
Ontologies3 minutes
Ontology Languages and Their Layers3 minutes
What is a Knowledge Graph? 3 minutes
Applications in NLP3 minutes

1 discussion promptTotal 30 minutes

Disambiguating the Future: WSD and the Semantic Web30 minutes

This module introduces students to the evolution of neural network architectures in NLP, beginning with recurrent models (RNNs), progressing through attention mechanisms, and culminating in Transformer-based models that have revolutionised natural language processing. Through hands-on coding and application-driven lessons, students will explore how Transformers power state-of-the-art systems in sentiment analysis (text classification), machine translation, and question answering. The module emphasises both theoretical foundations and practical implementation using modern deep learning frameworks.