====== Big Data Analytics ====== \\ | [[https://ekvv.uni-bielefeld.de/kvv_publ/publ/vd?id=394209201 | 392157]]/[[https://ekvv.uni-bielefeld.de/kvv_publ/publ/vd?id=394663185 | 392158]] | Schönhuth | Summer 2023 | Thu 10-12 & Wed 16-18 (V) | ==== Contents ==== The lecture Big Data Analytics develops competencies in performing data mining tasks on very large amounts of data that cannot be stored in main memory. The lecture provides the key ideas of similarity search using minhashing and locality-sensitive hashing, of data stream processing where data arrives so fast that it has to be processed immediately or is otherwise lost, of Web-related algorithms such as Google's PageRank, of algorithms for mining frequent itemsets, association rules and frequent subgraphs, of algorithms to analyze the structure of large graphs such as social network graphs, and of the map-reduce principle to design parallel algorithms. - Finding Similar Items - Stream Data Analysis - PageRank - MapReduce - Mining Frequent Itemsets - Mining Frequent Subgraphs - Mining Social Network Graphs - Recommender Systems ==== Literature ==== * A. Silberschatz, H. F. Korth, S. Sudarshan, „Database System Concepts“, 5th edition, McGraw Hill, 2006. * R. Elmasri und S.B. Navathe, „Fundamentals of Database Systems“, 5th edition, Pearson/Addison Wesley, 2007. * William H. Inmon, "Building the Data Warehouse", John Wiley & Sons, 1996. * Jure Leskovec, Anand Rajaraman, Jeffrey David Ullman, "Mining of Massive Datasets", 2nd Edition, Cambridge University Press, 2014. * Tom White, "Hadoop: The Definitive Guide Storage and Analysis at Internet Scale", 3rd edition, O'Reilly. * Viktor Mayer-Schönberger , Kenneth Cukier , " Big Data: A Revolution That Will Transform How We Live, Work and Think", John Murray, 2013. * Eric Redmond , Jim R. Wilson, "Seven Databases in Seven Weeks: A Guide to Modern Databases and the NoSQL Movement", O' Reilly, 2012. * Peter Gulutzan, Trudy Pelzer , "SQL Performance Tuning", Addison Wesley, 2002. ==== Important Links ==== * [[https://moodle.uni-bielefeld.de/course/view.php?id=845|Moodle Course]] * [[https://uni-bielefeld.cloud.panopto.eu/Panopto/Pages/Sessions/List.aspx?folderID=383f17ff-19ef-4a96-8c07-afd900a86ca3|Lecture Recordings]] ==== Time table lecture==== | **Date** | **Topic** | |06.04.2023 | [[:teaching:2023summer:bda:lecture01|Introduction]] ({{ teaching:2023summer:bda:introduction-060423.pdf| slides}}) | |12.04.2023 | [[:teaching:2023summer:bda:lecture02|Finding Similar Items I]] ({{ teaching:2023summer:bda:lecture2-findingsimilaritems1-120423.pdf | slides}}) | |13.04.2023 | [[:teaching:2023summer:bda:lecture03|Finding Similar Items II]] ({{ teaching:2023summer:bda:lecture3-findingsimilaritems2-130423.pdf | slides}}) | |20.04.2023 | [[:teaching:2023summer:bda:lecture04|Finding Similar Items III]] ({{ teaching:2023summer:bda:lecture4-localitysensitivehashing-200423.pdf|slides}}) | |26.04.2023 | [[:teaching:2023summer:bda:lecture05|Finding Similar Items IV / MapReduce I]] ({{ teaching:2023summer:bda:lecture5-lsh2-mapreduce1-260423.pdf|slides}}) | |27.04.2023 | [[:teaching:2023summer:bda:lecture06|Map Reduce II]] ({{ teaching:2023summer:bda:lecture6-mapreduce2-270423.pdf|slides}}) | |04.05.2023 | //no lecture// | |10.05.2023 | [[:teaching:2023summer:bda:lecture07|Map Reduce III]] ({{ teaching:2023summer:bda:lecture7-mapreduce3-100523.pdf|slides}})| |11.05.2023 | [[:teaching:2023summer:bda:lecture08|MapReduce IV / Link Analysis I]] ({{ teaching:2023summer:bda:lecture8-mapreduce4-linkanalysisi-110523.pdf|slides}})| |18.05.2023 | //no lecture// | |24.05.2023 | [[:teaching:2023summer:bda:lecture09|Link Analysis II]] ({{ teaching:2023summer:bda:lecture9-linkanalysisii-240523.pdf|slides}}) | |25.05.2023 | [[:teaching:2023summer:bda:lecture10|Link Analysis III / Frequent Itemsets I]] ({{ teaching:2023summer:bda:lecture10-linkanalysisiii-frequentitemsetsi-250523.pdf|slides}})| |01.06.2023 | [[:teaching:2023summer:bda:lecture11|Frequent Itemsets II]] ({{ teaching:2023summer:bda:lecture11-frequentitemsetsii-010623.pdf|slides}}) | |07.06.2023 | [[:teaching:2023summer:bda:lecture12|Recommendation Systems]] ({{ teaching:2023summer:bda:lecture12-recommendationsystems-070623.pdf|slides}}) | |08.06.2023 | //no lecture// | |15.06.2023 | [[:teaching:2023summer:bda:lecture13|Mining Data Streams I]] ({{ teaching:2023summer:bda:lecture13-miningdatastreams1-150623.pdf|slides}}) | |21.06.2023 | [[:teaching:2023summer:bda:lecture14|Mining Data Streams II]] ({{ teaching:2023summer:bda:lecture14-miningdatastreams2-210623.pdf|slides}}) | |22.06.2023 | [[:teaching:2023summer:bda:lecture15|Mining Data Streams III / Social Networks I]] ({{ teaching:2023summer:bda:lecture15-miningdatastreams3-socialnetworks1-220623.pdf|slides}}) | |29.06.2023 | [[:teaching:2023summer:bda:lecture16|Social Networks II]] ({{ teaching:2023summer:bda:Lecture16-SocialNetworks2-290623.pdf|slides}}) | |05.07.2023 | [[:teaching:2023summer:bda:lecture17|Social Networks III Support Vector Machines I]] ({{ teaching:2023summer:bda:Lecture17-SocialNetworks3-SVM1-050723.pdf|slides}}) | |06.07.2023 | [[:teaching:2023summer:bda:lecture18|Support Vector Machines II]] ({{ teaching:2023summer:bda:Lecture18-SVM2-060723.pdf|slides}}) | ==== Time table tutorials==== | **Date** | |05./06.04.2023| |19./20.04.2023| |03./04.05.2023| |17.05.2023| |31.05./01.06.2023| |14./15.06.2023| |28./29.06.2023| ==== Examination dates ==== ===1st Exam:=== TBD ===2nd Exam:=== TBD