Course ID: | MIST 5635/7635. 3 hours. |
Course Title: | Machine Learning and Business Analytics |
Course Description: | Topics in supervised learning, unsupervised learning, dimension reduction, and feature selection. Course covers multiple methods (e.g., regression, tree-based models, and deep-learning models) and their implementation in the R computing environment. An emphasis is placed on rigorously training and testing models to achieve high reliability and accuracy. |
Oasis Title: | Machine Learning Bus Analytics |
Undergraduate Prerequisite: | (BUSN 4000 or BUSN 4000E with a minimum grade of C) and (MIST 4600 or MIST 4600E with a minimum grade of C) |
Graduate Prerequisite: | (BUSN 4000 or BUSN 4000E with a minimum grade of C) and (MIST 4600 or MIST 4600E with a minimum grade of C) |
Undergraduate Pre or Corequisite: | MIST 5620 |
Graduate Pre or Corequisite: | MIST 7770 |
Semester Course Offered: | Offered every year. |
Grading System: | A-F (Traditional) |
|
Course Objectives: | Concepts
• Fundamentals of machine learning, including cross-validation, loss functions and model fit, and feature selection
• Tradeoffs in machine learning design (bias-variance tradeoff, overfitting vs. underfitting, parameter tuning)
• Supervised learning for both regression and classification problems
• Unsupervised learning and dimension reduction
• Use of machine learning models for prescriptive applications in business
• Deployment and maintenance of machine learning models
Techniques
• Supervised learning in R (linear regression, logistic regression, decision trees, SVM, neural networks)
• Unsupervised learning in R (centroid clustering, mixture models)
• Conducting cross-validation as part of a machine learning training pipeline
• Conducting simulations to test alternative policy choices
• Implementing a cloud-based machine learning pipeline |
Topical Outline: | Module 1: Basics of Machine Learning
• Introduction to loss functions, features, labels, and parameters
• Basics of linear algebra, calculus, and probability
• Training versus testing data and cross-validation
• Bias-variance tradeoff, overfitting, and underfitting
Module 2: Regression Models
• Linear regression in R
• Interaction terms and dummy variables
• Generalized linear models
• Regularization
Module 3: Classification Models
• Logistic regression
• Tree-based models
• Support vector machines
• Neural networks
Module 4: Unsupervised Methods
• Centroid methods such as K-means
• Mixture models
• Dimension reduction, including PCA and factor analysis
Module 5: Prescriptive Models
• Basic optimization concepts, with an introduction to linear programming
• Simulation in R, including Monte Carlo methods
Module 6: Deployment
• Basic design principles of a machine learning pipeline
• Integrating machine learning models into a cloud environment
• Best practices for maintaining and updating the pipeline |
Honor Code Reference: | As a University of Georgia student, you have agreed to abide by the University's academic honesty policy, "A Culture of Honesty," and the Student Honor Code. All academic work must meet the standards described in "A Culture of Honesty." Lack of knowledge of the academic honesty policy is not a reasonable explanation for a violation. Questions related to course assignments and the academic honesty policy should be directed to the instructor. |