Description

Storing, indexing, accessing and processing techniques for big data. Map/Reduce algorithm and related technologies. Data analysis and application in big data ecosystem. This course covers the basics of big data processing and analysis; answers the following questions: what the big data is and its differences from the traditional data and traditional processing methods, where it is used, how it is used.

General Information

Lectures
Tue 18:00 – 21:00 (Balgat Campus, A-307)
Textbook
1) Data-Intensive Text Processing with MapReduce
Jimmy Lin and Chris Dyer. Morgan & Claypool Publishers, 2010.
http://lintool.github.io/MapReduceAlgorithms/
2) Mining Massive Data Sets, 2nd Ed.
Jure Leskovec, Anand Rajaraman, Jeff Ullman
http://mmds.org
3) Mastering Apache Spark
Jacek Laskowski
https://www.gitbook.com/book/jaceklaskowski/mastering-apache-spark/details
4) Hadoop: The Definitive Guide, 3rd Edition
Tom White, O'Reilly
http://shop.oreilly.com/product/0636920021773.do

Announcements

Spark seminar
5/02/18 1:48 PM

Register 

Title: An Expert’s Guide to Apache Spark™

Date: Wednesday, May 23, 2018

Time: 09:00 AM Pacific Daylight Time

Duration: 1 hour

proposal.pdf has been added to class homepage under Resources
3/23/18 12:36 AM

The teaching staff has posted a new project resource.

Title: proposal.pdf
http://www.piazza.com/class_profile/get_resource/jde8xt6ku6rsq/jf31e0f21bv3mj

Due date: Apr 5, 2018

You can view it on the course page: https://piazza.com/cankaya.edu.tr/spring2018/ceng686/resources

Staff Office Hours
NameOffice Hours
Erdoğan Doğdu
When?
Where?