DATA616 DATA ENGINEERING, CURATION, AND ANALYTICS (GRADUATE, 3 CREDIT HOURS)

November 16, 2022

This course provides an experiential overview of current issues in data analytics from both analytic and computer sciences perspectives. The focus is on learning techniques for 1) scraping, cleaning, and manipulation of raw data from a variety of live sources and preparing them for analysis, 2) learning to manipulate and reorganize data to apply a variety of analytic tools, and 3) manipulate a variety of data features to enhance to predictive power of statistical models. The focus of this course will be primarily on techniques and tools used to extract data from various sources (primarily live and active data streams), prepare it for analysis, and then apply those analyses with a special emphasis on the understanding of feature engineering: the process of creating representations of data that increase the effectiveness of a predictive model.

Prerequisites: DATA600 Data Analysis using R, DATA604 Python Programming and Data Management; DATA610 Introduction to Data Mining Core courses.