Course offerings reflect the 2018-2019 Elmhurst College Catalog. One unit of credit equals four semester hours.
Topics include an integrated and detailed comparison of relational, hierarchical and network database systems. Database design and physical storage requirements, including distributed database design and related management issues, are discussed. High-level query languages using artificial intelligence techniques are reviewed along with other topics such as database compression, encryption and security.
Business intelligence represents a conceptual framework for decision support. It combines analytics, data warehouses, applications and methodologies to facilitate the transformation of data into meaningful and functional information. The major objective of business intelligence is to enhance the decision-making process at all levels of management. Data mining is a process that utilizes statistical analysis, probability theory, mathematical modeling, artificial intelligence and machine learning techniques to extract useful information and subsequent knowledge from large data repositories, commonly referred to as “big data.” This course examines a number of emerging methods proven to be of value in recognizing patterns and making predictions from an applications perspective. Students will be provided the opportunity for hands-on experimentation using software and case studies.
This course covers the application of appropriate high-level programming languages for expressing software design patterns used for extracting and processing big data within the Hadoop environment. These high-level languages include imperative, object-oriented languages such as PIG, HIVE and Scala. Examples will also be presented in Java and Python. The languages will be presented in support of big data processing relying on the map-reduce paradigm. Additional libraries will be explored in order to support activities of data mining as well as machine learning.
The ability to move data along the continuum from information to insight to action requires a strong foundation of skills in various quantitative methods. This course begins with a systematic and integrated overview of concepts from probability theory, statistics and mathematical modeling such as probability distributions, cumulative probability distributions, descriptive statistics, hypothesis testing, correlation analysis, linear regression, multivariate regression and mathematical model design. The course then proceeds to examine modern tools for conducting analyses using these quantitative methods on both small-scale and large-scale datasets. Case studies from a variety of settings are used to develop students’ abilities to successfully apply the techniques learned in this course to practical circumstances that often, because of the ambiguities involved, present limitations to the power of these mathematical tools. Topics from this course also provide the foundation for some subjects covered in the analytical methods course and the data mining and business intelligence course.
Each student completes a project incorporating the practical application of several of the program’s data mining techniques to one or more data sets chosen by the student or provided by the instructor. In addition to the correct use of the techniques and interpretation of the results, emphasis is placed on the student’s ability to gauge the resultant impact on the organization’s business intelligence processes and procedures. Prior to the submission of the final project, students submit a proposal describing the application and the data mining tools to be utilized.
This course builds upon the foundation established in the quantitative methods course to develop the advanced analytical methods required for in-depth applications of data science. Topics covered include advanced techniques in statistics and predictive modeling such as feature engineering, variations of multilinear and logistic regression, principal component analysis, advanced approaches to clustering and segmentation, time series forecasting, and biological methods such as neural networks and genetic algorithms. Course topics will be introduced from both a theoretical framework and through the use of case studies in applied settings.
This course provides a business-oriented framework for the data scientist to identify, prioritize and perform data analytical projects that drive business value and enhance competitive advantage. The course examines the data ecosystem, both external and internal to the enterprise, as well as business processes and networks upon which analytical projects can be used to reduce organizational risk and drive the creation of economic value. Topics covered include: marketing, sales and customer data exploration, supply chain data exploration, operations data exploration, financial data exploration and project management methods to convert information gain into business value. A variety of business process data sets will be examined using analytical tools, including Tableau, SAP Predictive Analytics and IBM Watson. Students will conduct the steps of an analytical workflow on a selected business process and create a project plan to drive value.
This course focuses on combining data science with a student’s prior project management training and experience to raise project management maturity in an organization. The course covers approaches for using data science tools and techniques to optimize quality, minimize risk and ensure vigilant decision making across projects in an enterprise portfolio. Topics covered include Monte Carlo methods for methodology evaluation, earned value analysis and dashboards, and data-mining techniques for establishing measured, repeatable and optimized project management processes. The project management maturity roadmap and the role of the PMO are also discussed in depth. Prerequisites: MDS 546, MDS 534, MPM 501, MPM 502. Students who are certified project management professionals can waive the project management course requirements.
This course covers advanced project management strategies and data science techniques needed for managing large scale organizational change and business transformation projects. Students plan and control each phase of a project implementation life cycle containing multiple streams of work. Emphasis is placed upon the use of data analysis and reporting for decision making in the complex quality and risk management situations encountered in technology based, organizational change projects. Prerequisite: MDS 561.
This course emphasizes the application of the primary topics covered in MDS 534 Data Mining and Business Intelligence and MDS 556 Analytical Methods within large case studies while learning to choose the appropriate programming language(s), software design pattern(s) and/or software tools, which are covered in CS 535 Programming Models and Environments. In these case studies students utilize data mining tools where appropriate and utilize advanced techniques in statistics and mathematical modeling for supporting conclusions and decisions. Students utilize software tools to visually present conclusions and decisions. Case studies are chosen from a wide spectrum of problem domains.
This course provides opportunities for students to study specialized topics in data science. Examples include time series forecasting and data science applications of linear algebra.
This course serves as the capstone for the program. Students will apply concepts learned and skills developed in the other required courses to complete a major project that demonstrates the full range of their data science knowledge and capabilities.
This course is the first of two courses required to complete a thesis option in the master’s program in data science.
This course is the second of two courses required to complete a thesis option in the master’s program in data science.
Students must complete two graduate-level electives at Elmhurst.