Python Basics for Data Analysis (DSC-2023-05) | 1,5 days


26.09. - 29.09.2023


26 Sep.: 08:15 - 11:45AM
27. Sep.: no training
28 Sep.: 08:15 - 11:45AM
29 Sep.: 08:15 - 11:45AM

Workshop for PhD students and Postdocs


Speaker:
Dr. Matthias Assenmacher


Location:
Online (Zoom)

The workshop will be held in
English.

The workshop is already fully booked.

Also, if you have any questions regarding our workshops, please feel free to write us an E-MAIL.






« Back

OBJECTIVES

The Python Basics course is intended for participants who want to learn basic Python skills as well as efficient handling of data preparation, data processing and data analysis in Python. In addition, general “best practices” in Python will be taught, including, writing simple, readable and modularly extensible code. All topics presented will be explained, demonstrated, and practiced in detail with the help of participant exercises under intensive instruction.

WORKSHOP CONTENT

  • Introduction to the basics of Python
  • Installation and use of Python and useful Python modules
  • Creating and working with virtual environments
  • Explanation of the most important data types, operators, functions and help pages
  • Introduction to NumPy and Pandas
  • Importing and exporting data
  • Working with DataFrames and vectors (numeric, logical, character, factors), e.g. indexing, splitting and transforming variables or data sets
  • Calculating statistical ratios (e.g.: mean, quantiles, variance, etc.)
  • Review of Python basics: built-in structures, numpy, IPython, jupyter notebook, package management, jupytext
  • Series and DataFrames: generation, meaning of line index, filtering, pointer vs. copy
  • Data cleansing: Handling missing values, editing strings, removing duplicates
  • Transforming data by vectorized operations like map or apply
  • Merging different data sources and creating a "good" table structure of the data
  • Grouping of data and aggregations: Split-Apply-Combine

TARGET AUDIENCE

This course is suitable for participants with no knowledge of Python or to refresh the basics in Python. The workshop is very hands-on and thus limited to max. 15 participants.

TECHNICAL REQUIREMENTS

Use a laptop/PC with reliable internet access and install the following software:



ABOUT THE TRAINER

Dr. Matthias Assenmacher works as trainer for Data Science Essential GmbH and is postdoctoral researcher at the Chair of Statistical Learning and Data Science (LMU) and the NFDI Consortium for Business, Economic and Related Data (BERD@NFDI). He obtained his bachelor’s degree in Economics from LMU in 2014, afterwards I turned to Statistics (with a focus on social and economic studies) and obtained his Master’s degree in 2017 (also from LMU). In October 2021 he finished his PhD with a focus on Natural Language Processing.

His expertise revolves around the practical application of state-of-the-art NLP architectures to real-world problems from various disciplines, as well as open and reproducible science.