An Odyssey through MSc Waters

  • 0 Rating
  • 0 Reviews
  • 2 Students Enrolled

An Odyssey through MSc Waters

This courselet offers an in-depth analysis of MSc theses from the LvB Chair of Statistics at HU Berlin. Employing Latent Dirichlet Allocation, we have identified and explored different topics within the theses. We invite you to explore the results of our investigation.

  • 0 Rating
  • 0 Reviews
  • 2 Students Enrolled
  • Wishlist
  • Free
Tags:



Courselet Content

2 components

Requirements

  • It is recommended to have a general understanding of: 1. Webscraping 2. Text preprocessing: cleaning data and making corpus 3. Dimenionsionality reduction methods, esp. UMAP 4. Hyperparameter tuning: grid search 5. Topic modeling approaches, esp. LDA

General Overview

Description

This courselet covers:

  1. Topic Modeling Approaches: An overview of LSA, PLSA, and LDA.
  2. Webscraping: Collecting MSc theses from HU Berlin website.
  3. Text Cleaning: Cleaning data from noise, stopwords, and rare words.
  4. Corpus Creation: Reorganizing data into a suitable format.
  5. LDA Application: Employing LDA with gridsearch for topic exploration.
  6. UMAP Visualization: Uncovering visual patterns in text data through UMAP.
  7. Dynamic Topic Modeling: Exploring topic evolution over time using DTM.

Recommended for you

blog
Last Updated 3rd December 2024
  • 4
  • Free
blog
Last Updated 3rd May 2024
  • 15
blog
Last Updated 10th December 2023
  • 5
blog
Last Updated 19th July 2023
  • 0
  • 0
blog
Last Updated 16th June 2023
  • 5
blog
Last Updated 17th December 2022
  • 7
blog
Last Updated 16th January 2023
  • 2
  • Free
blog
Last Updated 7th January 2023
  • 5
  • Free
blog
Last Updated 14th March 2025
  • 5
  • Free
blog
Last Updated 19th November 2023
  • 7
blog
Last Updated 7th November 2022
  • 13
  • Free
blog
Last Updated 21st March 2025
  • 196
  • Free

Meet the instructors !

instructor
About the Instructor

Hello! I am a member of LDA MSc Theses team in DEDA class, we want to upload our final slides