INFO 5368

Global toggle of class tabs

Links for textbooks and Cornell Store open in new tab.

INFO 5368

Course information provided by the Courses of Study 2023-2024.

This course provides hands-on experience developing and deploying foundational machine learning algorithms on real-world datasets for practical applications (e.g., healthcare, computer vision). Students will learn about the machine learning pipeline end-to-end including dataset creation, pre- and post-processing, annotation, annotation validation, preparation for machine learning, training and testing a model, and evaluation. Students will focus on real-world challenges at each stage of the ML pipeline while handling bias in models and datasets. Lastly, students will analyze the strengths and weaknesses of regression, classification, clustering, and deep learning algorithms.

When Offered Spring.

Prerequisites/Corequisites Prerequisite: recommended coursework in Python Programming

Outcomes
  • Collect a new dataset and prepare it for a ML task, train a model, and evaluate it.
  • Apply regression, classification, clustering, and deep learning algorithms to practical applications.
  • Analyze and identify key differences in regression, classification, clustering, and deep learning algorithms.
  • Understand core challenges of dataset creation including handling missing data, bias, unlabeled data, among others.
  • Represent features in datasets to be used for ML tasks.
  • Evaluate model quality using appropriate metrics of performance

View Enrollment Information

Syllabi: none
  •   Regular Academic Session. 

  • 3 Credits Graded

  •  9833 INFO 5368   LEC 030

  • Taught in NYC at Cornell Tech. Enrollment Limited to Cornell Tech Students only.