Dec 26, 2024  
College Catalog 2022-2023 
    
College Catalog 2022-2023 [ARCHIVED CATALOG]

DS 420 - Big Data

4.00 credits.
This course covers techniques needed to collect, store, analyze, and visualize big data, particularly for applications in machine learning. The MapReduce paradigm will be taught using the popular Hadoop framework. Both batch and real-time analysis of massive quantities of data will be applied to machine learning problems such as clustering, regression, and classification. Although the relational database model will be discussed, NoSQL models will have primary focus. *Prerequisite(s): DS 200  and CS 209 . Spring semester, odd-numbered years.