Intendierte Lernergebnisse
The successful student will have a deeper understanding of the challenges imposed by Big Data and know state of the art data engineering methods and techniques focusing on big data applications.
Lehrmethodik
The VC will be a mixture of a classical lecture, presentations of assignment solutions and student presentations.
Inhalt/e
Introduction to Big Data, Data Engineering and Data Science.Recap on RDBMS and common file formats. Managing XML and JSON in RDBMS. Advanced SQL queries.Scaling of RDBMS. Data WarehousesBig Data FrameworksMapReduceApache SparkSQL on Big Data Architectures(Big) Data IntegrationData Provenance and Data QualityData LakesUsed Programming Languages: Java, Scala. Scala assignments can optionally be solved in Python.
Erwartete Vorkenntnisse
Relational Databases (Lecture "Datenbanken"), Java Programming
Literatur
Principles of Database Management: The Practical Guide to Storing, Managing and Analyzing Big and Small Data. Cambridge University Press New York, NY, USA ©2018 ISBN:1107186129 9781107186125