Apr 18, 2024  
College Catalog 2021-2022 
    
College Catalog 2021-2022 [ARCHIVED CATALOG]

DS 420 - Big Data

4.00 credits.
This course covers techniques needed to collect, store, analyze, and visualize big data, particularly for applications in machine learning. The MapReduce paradigm will be taught using the popular Hadoop framework. Both batch and real-time analysis of massive quantities of data will be applied to machine learning problems such as clustering, regression, and classification. Although the relational database model will be discussed, NoSQL models will have primary focus. *Prerequisite(s): DS 200  and CS 209 .