The lecture Big Data Analytics develops competencies in performing data mining tasks on very large amounts of data that cannot be stored in main memory. The lecture provides the key ideas of similarity search using minhashing and locality-sensitive hashing, of data stream processing where data arrives so fast that it has to be processed immediately or is otherwise lost, of Web-related algorithms such as Google's PageRank, of algorithms for mining frequent itemsets, association rules and frequent subgraphs, of algorithms to analyze the structure of large graphs such as social network graphs, and of the map-reduce principle to design parallel algorithms.
This class will be taught online, through video lectures.
Date | Topic |
15.04.2021 | Introduction / Organization / Schedules ( slides) |
22.04.2021 | Finding Similar Items I ( slides) |
29.04.2021 | Finding Similar Items II ( slides) |
06.05.2021 | Map Reduce / Workflow Systems I ( slides) |
13.05.2021 | no lecture |
20.05.2021 | Map Reduce / Workflow Systems II ( slides) |
27.05.2021 | Map Reduce / Workflow Systems III & Mining Data Streams I ( slides) |
03.06.2021 | no lecture |
10.06.2021 | Mining Data Streams II / Link Analysis I ( slides) |
17.06.2021 | Link Analysis II ( slides) |
24.06.2021 | Mining Frequent Itemsets I ( slides) |
01.07.2021 | Recommendation Systems ( slides) |
08.07.2020 | Social Networks I ( slides) |
15.07.2020 | Mining Data Streams III / Q&A |
22.07.2020 | Mining Frequent Itemsets II ( slides) |
1st Exam: Written exam on Thursday, July 29, 2021 12:00-14:00 in Lockschuppen Bielefeld (location may change)
2nd Exam: Online exam on Tuesday, September 28, 2021 (time tbd)