Explanatory Data Analysis and Data Preparation - Part 2

  • 0 Rating
  • 0 Reviews
  • 6 Students Enrolled

Explanatory Data Analysis and Data Preparation - Part 2

Introduction to the foundations of Data Science with focus on business applications. We emphasize supervised machine learning algorithms to build predictive decision support models for credit risk, marketing analytics, and several other use cases.

  • 0 Rating
  • 0 Reviews
  • 6 Students Enrolled
  • Free
Tags:



Courselet Content

1 components

Requirements

  • - no specific requirements - working knowledge of multivariate statistics is useful - prior experiences with computer programming are helpful but not mandatory

General Overview

Description

"Data is the new oil..."

You might have heard this saying or a similar phrase before. Big Data, Analytics, Data Science, Artificial Intelligence, Machine Learning, ... many colorful terms refer to the increasing use of analytical models that aim at extracting insight from the vast amounts of data that the digital society is producing.

The module Business Analytics and Data Science (BADS) is concerned with theories, concepts, and practices to support decision-making by means of formal, data-driven methods. We will revisit different forms of model-based decision support, examine the standard workflow of modern data analysis, and discuss a broad set of models for descriptive and predictive analytics. Predictive analytics is the main focus of the course. Many corporate use cases of analytics and data science involve predicting some future state or behavior, for example, how customers will respond to certain marketing stimuli. We will introduce statistical principles of learning from data and cover several common prediction methods, ranging from established industry workhorses like logistic regression to state-of-the-art machine learning algorithms such as gradient boosting. Subsequently, we will dive into specific tasks in the predictive modeling pipeline such as e.g., feature selection or remedies to the class imbalance problem. Given a variety of specialized modeling tasks and challenges, we will focus on topics with high relevance to managerial decision-making including cost-sensitive learning and model explainability (i.e., XAI).

The module consists of a lecture and a tutorial session. The lecture introduces relevant concepts and provides room for discussion. The goal of the tutorial is to empower students to develop state-of-the-art analytical models using contemporary programming libraries for data science. Specifically, we will use the Python programming language. Students receive demos on how to implement specific algorithms from scratch and work with real-world data to solve common modeling tasks themselves.

In summary, the module pursues the following learning objectives:

  • Students are familiar with the three branches of descriptive, predictive, and prescriptive analytics and appreciate the relationships between these streams.
  • Given some data, students are able to select appropriate techniques to summarize and visualize the data to maximize managerial insight.
  • Students understand the potential and also limitations of predictive analytics to aid decision-making. Given a decision task, they can discuss the relative merits and demerits of alternative algorithms and recommend a suitable prediction method.
  • Students are familiar with Python programming and standard Python libraries for data handling and machine learning. Using these tools, they can develop basic and advanced prediction models and assess their accuracy in a statistically sound manner.

It is not strictly necessary that students join the course with prior experience in computer programming. We reserve the first two weeks of the tutorial to introduce programming principles and the Python programming language. That said, high and continuous engagement with the module in general and the tutorial in particular including ample time for self-study is expected to ensure the completion of our ambitious learning program. Students who wish to prepare for the course are invited to complete some of the many excellent tutorials on Python programming. A simple web search for "Python programming introduction" produces tons of results. The resources at Python.org also provide an excellent starting point.

We are looking forward to seeing you in BADS.

Courses that include this CL

blog
Last Updated 19th March 2024
  • 196
  • Free

Meet the instructors !

instructor
About the Instructor

Stefan received a PhD from the University of Hamburg in 2007, where he also completed his habilitation on decision analysis and support using ensemble forecasting models in 2012. He then joined the Humboldt-University of Berlin in 2014, where he heads the Chair of Information Systems at the School of Business and Economics. He serves as an associate editor for the International Journal of Business Analytics, Digital Finance, and the International Journal of Forecasting, and as department editor of Business and Information System Engineering (BISE). Stefan has secured substantial amounts of research funding and published several papers in leading international journals and conferences. His research concerns the support of managerial decision-making using quantitative empirical methods. He specializes in applications of (deep) machine learning techniques in the broad scope of marketing and risk analytics. Stefan actively participates in knowledge transfer and consulting projects with industry partners; from start-up companies to global players and not-for-profit organizations.