Lehrstuhl  |  Institut  |  Fakultät  |  LMU

Hauptseminar "Big Data Tools" im WS 2015/16


  • The topics have been assigned.


"Big Data" is a term often used to describe data which potentially may provide valuable insights, but which amounts are too large to be processed by "classical" methods and tools. In practice, it is not only the major internet companies who deal with "Big Data". Nowadays, data is generated by sensors, cameras, or social networks. To conclude, there are business cases for different types of organizations. Besides of the large volume, "Big Data" is often characterized by the lack of structure, uncertainty and high speed of data generation. In the last years various algorithms were proposed for Big Data. Furthermore, various software tools were developed to handle "Big Data" processing, analysis and persistence. In the seminar, we will focus on the Big Data tools. We want to analyze currently available software products, determine use cases they are appropriate for and compare similar components. The participants will apply the tools to the use cases and report on their own experiences with the tools. The goal of the seminar is to enable each participant to assemble a "Big Data" software stack under consideration of particular requirements.

What is expected

Each participant will give two talks. The first talk introduces the topic and to the planned activities. The first talk can be videotaped for self-analysis. This way, each participant will get the possibility to discuss the plan and get feedback about his or her personal presentation style. Another goal for the first talk is to get the overview of the tools for all participants. In the second talk, the participants present their results. The final grade is based on the second talk and seminar paper grades. Talks can be held in English or German.

How to apply:

The registration is open until 21.09. After registration please write and a mail. Describe your experience with big data or data analysis (relevant work experience or courses). Please select three topics and write your priority for each of them. Please note, your registration is valid only if you send us a registration email within the registration deadline. Registration emails sent after registration deadline expiration will be ignored.


  • Cassandra and Mongo DB
  • Hbase
  • Redis, (Kafka or Rabbit MQ)
  • Apache Drill (not assigned)
  • Spark Streaming
  • Spring XD (not assgined)
  • Storm (not assigned)
  • Flink
  • Akka (not assigned)
  • Zookeeper (not assigned)
  • Mesos
  • Spark MLib
  • GraphX
  • H20
  • Docker+Vagrant (not assigned)
  • Keystone ML
  • SFrame
  • Apache Uima (DUCC)
  • (Propose your topic)



Datum TerminContent
15.10.2015Einführung slides


07.01.2016 - fällt aus

14.01.2016 - mesos, keystone ml

21.01.2016 - cassandra+mongo db, redis and kafka

28.01.2016 - apache uima, spark mlib

04.02.2016 - Ersatztermin

Abgabe der Seminararbeit

Bis zum 29. Februar

Ort und Zeit

Veranstaltung Zeit Ort Beginn
Seminar Do, 16.00-18.00 Uhr Oettingenstr. 67 027

Weiterführende Linsk

Zusätzliche Informationen

Vorhergehende Semester