Time | Mon to Thu 9:50am-11:30am |
Location | Snell Library 033 |
Please use Piazza to ask questions which can potentially be answered by your peers (e.g., setting up the programming environment, Amazon account creation, slide content). If you want to send me a private message to the instructor, you can either use Piazza or email. If you wish, you can also leave anonymous feedback via this anonymous Google form, which only the instructor can see.
tziavelis.n (at) northeastern (dot) edu | |
Web | https://ntzia.github.io/ |
Office Hours | Tuesdays 2-5pm (Zoom link on Canvas) |
saranyan.s (at) northeastern (dot) edu | |
Web | https://ssantosh1999.wixsite.com/eportfolio/ |
Office Hours | Thursdays 2-5pm (Zoom link on Canvas) |
neema.r (at) northeastern (dot) edu | |
Web | https://ronhitneema.github.io/myportfolio.github.io/ |
Office Hours | Fridays 2-5pm (Zoom link on Canvas) |
hawal.s (at) northeastern (dot) edu | |
Office Hours | Mondays 5-8pm (Zoom link on Canvas) |
This really is an algorithms course at heart. You will write plenty of code, but the main emphasis is on learning how to approach big-data analysis problems. You will need solid Java or Python programming skills to succeed, but we are not teaching any Java/Python basics in this course. You do not need advanced Scala skills and should be able to pick up what you need on-the-fly with reasonable effort.
If you believe that programming in Java or Scala presents an insurmountable barrier for you, contact the instructor as soon as possible to find a solution. It is possible to program in other languages, but we generally cannot promise any support for them—so you may be on your own if you get stuck. Students in the past completed their homework successfully using Python for both MapReduce and Spark. Python is well supported in Spark and the programs often look similar to those written in Scala.
You should only take this course if you are prepared to deal with such issues and are willing to put in extra time when necessary. Do not take this course if you want a well-polished and well-tested course without any uncertainty. If you are genuinely interested in the topic and are ready to work around the inevitable frustrations, then this will be a rewarding experience.
This course has been designed and taught by Mirek Riedewald. We will for the most part follow the same structure and use the same material. For the course web page design, credit goes to Nate Derbinsky and to Wolfgang Gatterbauer for some modifications.