Graduate Data Science Course Descriptions


DS-500. Introduction to R Programming. 0 Credits.

This is an introductory course that covers statistical computing, which includes programming in R, reading data into R, accessing R packages, writing R functions, debugging, profiling R code, and organizing and commenting R code. Student may also work on a hands-on project, and apply functions, read/write files, and measure execution times, and finally present their computing prototype in class.

DS-510. Introduction to Data Science. 3 Credits.

Data Science is a set of fundamental principles that guide the extraction of valuable information and knowledge from data. This course provides an overview and develops student's understanding of the data science and analytics landscape in the context of business examples and other emerging fields. It also provides students with an understanding of the most common methods used in data science. Topics covered include introduction to predictive modeling, data visualization, probability distributions, Bayes' theorem, statistical inference, clustering analysis, decision analytic thinking, data and business strategy, cloud storage and big data analytics.

DS-520. Data Analysis and Decision Modeling. 3 Credits.

This course will provide students with an understanding of common statistical techniques and methods used to analyze data in business. Topics covered include probability, sampling, estimation, hypothesis testing, linear regression, multivariate regression, logistic regression, analysis of variance, categorical data analysis, Bootstrap, permutation tests and nonparametric statistics. Students will learn to apply statistical techniques to the processing and interpretation of data from various industries and disciplines.

DS-530. Big Data and Data Mangement. 3 Credits.

This course explores foundational concepts of relational databases, data warehousing, distributed data management, structured and unstructured data, NoSQL data stores and graph databases. Various database concepts are discussed including Extract-Transform-Load, cloud-based online analytical processing (OLAP), data warehouse architecture, development and planning, physical database design, data pipelines, metadata, data provenance, trust and reuse. Students will develop practical experience using SQL.

DS-540. Statistical Programming. 3 Credits.

The course gives an introduction to SAS or R programming for statistical analyses and managing, analyzing and visualizing data. Topics include numeric and non-numeric values, arithmetic and assignment operations, arrays and data frames, special values, classes and coercion. Students will learn to write functions, read/write files, use exceptions, measure execution times, perform sampling and confidence analyses, plot a linear regression. Students will explore tools for statistical simulation, large data analysis and data visualization, including interactive 3D plots.

DS-550. Financial Computing and Analytics. 3 Credits.

Financial Computing and Analytics focuses on methods and algorithms for financial simulation and optimization, and encompasses computational finance including mathematical modeling, quantitative risk management and market simulation. The course provides the students with the skills for cutting edge quantitative careers in the finance industry. It also covers algorithm design and development, high performance computing and software engineering for financial applications. In addition to practical experience in novel computational methods and platforms, students are exposed to a selection of research topics that may include algorithmic trading, stochastic processes, quantitative risk models, financial econometrics, computational statistics, simulation, network and statistical signal processing.

DS-600. Data Mining. 3 Credits.

Data mining refers to a set of techniques that have been designed to efficiently find important information or knowledge in large amounts of data. This course will provide students with understanding of the industry standard data mining methodologies, and with the ability of extracting information from a data set and transforming it into an understandable structure for further use. Topics covered include decision trees, classification, predictive modeling, association analysis, statistical modeling, Bayesian classification, anomaly detection and visualization. The course will be complemented with hands-on experience of using advanced data mining software to solve realistic problems based on real-world data.

DS-610. Big Data Analytics. 3 Credits.

Big Data (Structured, semi-structured, & unstructured) refers to large datasets that are challenging to store, search, share, visualize, and analyze. Gathering and analyzing these large data sets are quickly becoming a key basis of competition. This course explores several key technologies used in acquiring, organizing, storing, and analyzing big data. Topics covered include Hadoop, unstructured data concepts (key-value), Map Reduce technology, related tools that provide SQL-like access to unstructured data: Pig and Hive, NoSQL storage solutions like HBase, Cassandra, and Oracle NoSQL and analytics for big data. A part of the course is devoted to public Cloud as a resource for big data analytics. The objective of the course is for students to gain the ability to employ the latest tools, technologies and techniques required to analyze, debug, iterate and optimize the analysis to infer actionable insights from Big Data.

DS-620. Data Visualization. 3 Credits.

Visualization concerns the graphical depiction of data and information in order to communicate its contents and reveal patterns inherent in the data. It is sometimes referred to as visual data mining, or visual analytics. Data visualization has become a rapidly evolving science. This course explores the underlying theory and practical concepts in creating visual representations of large amounts of data. Topics covered include data representation, information visualization, real-time visualization, visualization toolkits including Tableau and their applications to diverse data rich contexts. At the end of the course, the student will be able to present meaningful information in the most compelling and consumable fashion.

DS-630. Machine Learning. 3 Credits.

Machine learning is the field of study that gives computers the ability to learn from experience without being explicitly programmed. This course covers the theory and practical algorithms for machine learning from a variety of perspectives. Topics include decision tree learning, parametric and non-parametric learning, Support Vector Machines, statistical learning methods, unsupervised learning, reinforcement learning and the Bootstrap method. Students will have an opportunity to experiment with machine learning techniques and apply them to solve a selected problem in the context of a term project. The course will also draw from numerous case studies and applications, so that students learn how to apply learning algorithms to build machine intelligence.

DS-640. Predictive Analytics and Financial Modeling. 3 Credits.

Predictive analytics is an area of data mining that deals with extracting information from data and using it to predict trends and behavior patterns. This course will provide predictive analytics foundational theory and methodologies as well as teach students how to build predictive models for practical financial and business applications and verify model effectiveness. Topics covered are linear modeling and regression, nonlinear modeling, time series analysis and forecasting, segmentation and tree models, support vector machine, clustering, neural networks and association rules.

DS-650. Data Law, Ethics and Business Intelligence. 3 Credits.

The increasing use of big data in our society raises legal and ethical questions. Business intelligence is the process of collecting and transforming raw data into meaningful and useful information for business purposes. This course explores the issues of privacy, data protection, non-discrimination, equality of opportunities and due process in the context of data-rich environments. It analyzes ethical and intellectual property issues related to data analytics and the use of business intelligence. Students will also learn the legal obligations in collecting, sharing and using data, as well as the impact of algorithmic profiling, industrial personalization and government. This course also provides an understanding of the important capabilities of business intelligence, the technologies that enable them and the management of business intelligence.

DS-660. Business Analytics. 3 Credits.

Business analytics is the process of generating and delivering the information acquired that enables and supports an improved and timely decision process. The aim of this course is to provide the student with an understanding of a broad range of decision analysis techniques and tools and facilitate the application of these methodologies to analyze real-world business problems and arrive at a rational solution. Topics covered include foundations of business analytics, descriptive analytics, predictive analytics, prescriptive analytics, and the use of computer software for statistical applications. The course work will provide case studies in Business Analytics and present real applications of business analytics. Students will work in groups to develop analytic solutions to these problems.

DS-670. Capstone: Big Data and Business Analytics. 3 Credits.

This course is structured as a capstone research practicum where students have an opportunity to apply the knowledge acquired in data science to interdisciplinary problems from a variety of industry sectors. Students work in teams to define and carry out an analytics project from data collection, processing and modeling to designing the best method for solving the problem. The problems and datasets used in this practicum will be selected from real world industry or government settings. At the end of the class students will write a report that presents their project, the approach and techniques used to design a solution, followed by results and conclusion. Students are encouraged to present their capstone research at conferences.

DS-680. Marketing Analytics and Operations Research. 3 Credits.

Organizations need to interpret data about consumer choices, their browsing and buying patterns and to match supply with demand in various business settings. This course examines the best practices for using data to prescribe more effective business strategies. Topics covered include marketing resource allocation, metrics for measuring brand assets, customer lifetime value, and using data analytics to evaluate and optimize marketing campaigns. Students learn how data is used to describe, explain, and predict customer behavior, and meet customer needs. Students also learn to model future demand uncertainties, predict the outcomes of competing policy choices and take optimal operation decisions in high and low risk scenarios.

DS-690. Data Science and Health. 3 Credits.

Students will be introduced to the types of data commonly used in public health, biomedical and clinical settings. Students will acquire the knowledge and skills to use these data for understanding and improving the quality of health outcomes. Through lectures and class data analysis projects, students will explore, analyze and create graphical visualization of data from a variety of healthcare sources. Students will also be exposed to selective topics on real time analytics, clinical informatics, and machine learning for biomedical applications.