• Genome Data Science

    We develop methods and tools to work with tens of thousands of genomes and analyze and integrate the corresponding data.

    Model of DNA double helix in front of a student.
    © Universität Bielefeld

Programming


392168/392169 Schönhuth, Pianesi Winter 2024/25 Wed 14:00-16:00 H12 & Zoom (Q&A) and Thu 10:00-12:00 C01-220 & Zoom (Ü)

Contents

Data Science is an emerging interdisciplinary field with the aim to extract information from prevalently unstructured data. A basic skill for every data scientist is programming.

This course sets out to introduce Python, a modern object-oriented programming language, to prospective data scientists. The class covers basic programming skills and provides an introduction to computer science. In the second part, Python libraries and tools are presented that are handy in the daily life of a data scientist, such as Jupyter Notebook, NumPy, Pandas, Matplotlib, Scikit-Learn, and Pyspark.

No prior knowledge of computer science is required, but basic training in mathematics is assumed.


This class will be taught on site and online via Zoom.
Tutorials are offered on site and online via Zoom.

Literature

Contact

Time table lecture

Date Topic Discussion Exercise Upload
09.10.2024 Organizational matters, intro to programming and ChatGPT (slides) Exercise 01 (file)
16.10.2024 Programming and Python basics (slides) Exercise 01 (file)
23.10.2024 Data types, arithmetic operations & Conditions, comparisons (slides) Exercise 02 (file)
30.10.2024 Loops (slides) ONLINE ONLY Exercise 03 (file)
06.11.2024 Functions, debugging & Functional programming, lazy evaluation Exercise 04
13.11.2024 Object oriented Programming Exercise 05
20.11.2024 NO LECTURE Exercise 06
27.11.2024 Input, processing of files and Text Mining Exercise 07
04.12.2024 Data visualization and NumPy Exercise 08
11.12.2024 Pandas Exercise 09
18.12.2024 Machine Learning Exercise 10
25.12.2024 Christmas Break
01.01.2025 Christmas Break
08.01.2025 Databases and distributed computing Exercise 11
15.01.2025 Advanced topic 1 - TBD
22.01.2025
29.01.2025

Time table tutorial

Date Exercise Discussion
10.10.2024 Intro from ground up
17.10.2024 Exercise 00, ChatGPT & Scratch
24.10.2024 Exercise 01, Python basics
31.10.2024 Exercise 02, Data types & more ONLINE ONLY
07.11.2024 Exercise 03, Loops
14.11.2024 Exercise 04, Functions & more
21.11.2024 NO LECTURE
28.11.2024 Exercise 05, OOP
05.12.2024 Exercise 06, I/O
12.12.2024 Exercise 07, NumPy
19.12.2024 Exercise 08, Pandas
26.12.2024 Christmas Break
02.01.2025 Christmas Break
09.01.2025 Exercise 09, ML
16.01.2025 Exercise 10, Distr. computing
23.01.2025 Free Q&A