An Odyssey through MSc Waters

  • 0 Rating
  • 0 Reviews
  • 2 Students Enrolled

An Odyssey through MSc Waters

This courselet offers an in-depth analysis of MSc theses from the LvB Chair of Statistics at HU Berlin. Employing Latent Dirichlet Allocation, we have identified and explored different topics within the theses. We invite you to explore the results of our investigation.

  • 0 Rating
  • 0 Reviews
  • 2 Students Enrolled
  • Wishlist
  • Free
Tags:



Courselet Content

2 components

Requirements

  • It is recommended to have a general understanding of: 1. Webscraping 2. Text preprocessing: cleaning data and making corpus 3. Dimenionsionality reduction methods, esp. UMAP 4. Hyperparameter tuning: grid search 5. Topic modeling approaches, esp. LDA

General Overview

Description

This courselet covers:

  1. Topic Modeling Approaches: An overview of LSA, PLSA, and LDA.
  2. Webscraping: Collecting MSc theses from HU Berlin website.
  3. Text Cleaning: Cleaning data from noise, stopwords, and rare words.
  4. Corpus Creation: Reorganizing data into a suitable format.
  5. LDA Application: Employing LDA with gridsearch for topic exploration.
  6. UMAP Visualization: Uncovering visual patterns in text data through UMAP.
  7. Dynamic Topic Modeling: Exploring topic evolution over time using DTM.

Recommended for you

blog
Last Updated 8th March 2025
  • 1
blog
Last Updated 4th September 2024
  • 8
  • Free
blog
Last Updated 3rd May 2024
  • 15
blog
Last Updated 30th January 2024
  • 63
  • Free
blog
Last Updated 16th June 2023
  • 5
blog
Last Updated 27th February 2026
  • 3
  • Free
blog
Last Updated 5th June 2025
  • 0
  • Free
blog
Last Updated 16th January 2023
  • 1
  • Free
blog
Last Updated 31st July 2025
  • 12
blog
Last Updated 23rd August 2024
  • 4
blog
Last Updated 13th December 2022
  • 99
  • Free
blog
Last Updated 21st March 2025
  • 196
  • Free

Meet the instructors !

instructor
About the Instructor

Hello! I am a member of LDA MSc Theses team in DEDA class, we want to upload our final slides