Machine Learning Learning Path
Published:
Link repo: vnk8071/machine-learning-learning-path
Nanodegree
NanoDegree is a series of courses in a specific field. It is a paid course. You can learn more about Nanodegree here.
Noted nanodegrees: Udacity - Google drive
Data Analyst (nd002)
- Syllabus: Syllabus
- Noted course: Google Drive
Projects:
1. Introduction to Data Analysis with Pandas and NumPy
- Learn the data analysis process of questioning, wrangling, exploring, analyzing, and communicating data. You will work with data in Python using libraries like NumPy and pandas.
- Project: Investigate a Dataset
- Source code: vnk8071/investigate_a_dataset
2. Advanced Data Wrangling
- Data wrangling is a set of processes for turning raw and messy data into a clean format to answer interesting questions from the data. In this course, you will learn the three phases of data wrangling: gathering, assessing, and cleaning data.
- Project: Real World Data Wrangling with Python
- Source code: vnk8071/advanced_data_wrangling
3. Data Visualization with Matplotlib and Seaborn
- Learn to apply sound design and data visualization principles to the data analysis process. Learn how to use analysis and visualizations to tell a story with data.
- Project: Communicate Data Findings
- Source code: vnk8071/communicate_data_findings
Intermediate Python Nanodegree (nd303)
- Syllabus: Syllabus
- Noted course: Google Drive
Projects:
1. Advanced Python Techniques
- In this course, you will learn advanced Python skills and master a myriad of modern subject matter.
- Project: Exploring Near-Earth Objects
- Source code: vnk8071/advanced_python_techniques
2. Large Python Codebases with Libraries
- Learn how you can write, structure, and extend your code to be able to support developing large systems at scale. Understand how you can leverage open source libraries to quickly add advanced functionality to your code and how you can package your code into libraries of your own. Apply Object Oriented Programming to ensure that your code remains modular, clear, and understandable. Honing these skills are the foundations for building codebases that are maintainable and efficient as they grow to tens of thousands of lines.
- Project: Motivational Meme Generator
- Source code: vnk8071/meme_generator
Full Stack Web Developer (nd0044)
- Syllabus: Syllabus
- Noted course: Google Drive
Projects:
1. SQL and Data Modeling for the Web
- You’ll learn the skills needed to traverse the stack and develop an entire database-backed web application. By the end of the course, you’ll have the fundamentals you need to start building web applications, including how to do Create, Read, Update, and Delete (CRUD) operations on a database, how to apply these operations across both databases and web applications, how to set up relationships between elements of an application, and ultimately how to think about important principles and patterns in building data models for a web application
- Project: Fyyur: Artist Booking Site
- Source code: vnk8071/sql_data_modeling_for_web
2. API Development and Documentation
- In this project, you will use the skills you’ve developed to build a Trivia API. The goal of this project is to use APIs to control and manage a web application using existing data models. You’ll be given a set of data models and the application front end. Your task will be to implement the API in Flask to make the Trivia game functional.
- Project: Trivia API
- Source code: vnk8071/api_development_documentation
3. Identity Access Management
- In this part, you will build the backend for a coffee shop application. You’ll add user accounts and authentication to your application and use role-based access management strategies to control different types of user behavior in the app.
- Project: Coffee Shop Full Stack
- Source code: vnk8071/identity_access_management
4. Server Deployment and Containerization
- Develop an understanding of containerized environments, use Docker to share and store containers, and deploy a Docker container to AWS Elastic Kubernetes Service using the CI/CD pipeline.
- Project: Deploy Your Flask App to Kubernetes Using EKS
- Source code: vnk8071/server_deployment_containerization
- Template: vnk8071/pipeline-deploy-kubernetes-on-aws
5. Full Stack Capstone
- You will now combine all of the new skills you’ve learned and developed in this course to construct a database-backed web API with user access control. You will choose what app to build and then you’ll design and build out all of the API endpoints needed for the application and properly secure them for use in any front end application (web or mobile).
- Capstone Coffee Shop Fullstack
- Source code: vnk8071/capstone_fullstack_web
Machine Learning Engineer (nd009t)
- Syllabus: Syllabus
- Noted course: Google Drive
Projects:
1.Introduction to Machine Learning
- In this course, you’ll start learning what machine learning is by being introduced to the high level concepts through AWS SageMaker. You’ll begin by using SageMaker Studio to perform exploratory data analysis. Know how and when to apply the basic concepts of machine learning to real world scenarios. Create machine learning workflows, starting with data cleaning and feature engineering, to evaluation and hyperparameter tuning. Finally, you’ll build new ML workflows with highly sophisticated models such as XGBoost and AutoGluon.
- Project: Predict Bike Sharing Demand with AutoGluon
- Source code: vnk8071/predict_bike_sharing_demand
2. Developing Your First ML Workflow
- This course discusses how to use AWS services to train a model, deploy a model, and how to use AWS Lambda Functions, Step Functions to compose your model and services into an event-driven application.
- Project: Build a ML Workflow For Scones Unlimited On Amazon SageMaker
- Source code: vnk8071/ml_workflow_for_scones_unlimited
3. Deep Learning Topics within Computer Vision & NLP
- Project: Image Classification using AWS SageMaker
- Source code: vnk8071/image_classification_using_sagemaker
4. Operationalizing Machine Learning on SageMaker
- This course covers advanced topics related to deploying professional machine learning projects on SageMaker. Students will learn how to maximize output while decreasing costs. They will also learn how to deploy projects that can handle high traffic, how to work with especially large datasets, and how to approach security in machine learning AWS applications.
- Project: Operationalizing an AWS ML Project
- Source code: vnk8071/operationalizing_aws_ml
5. Capstone Project: Build Your Own Machine Learning Portfolio
- Project: Build Your Own Machine Learning Portfolio
- Source code: vnk8071/capstone_ml_engineer
Machine Learning DevOps Engineer (nd0821)
- Syllabus: Syllabus
- Noted course: Google Drive
Projects:
1. Clean Code Principles
- Develop skills that are essential for deploying production machine learning models. First, you will put your coding best practices on auto-pilot by learning how to use PyLint and AutoPEP8. Then you will further expand your git and Github skills to work with teams. Finally, you will learn best practices associated with testing and logging used in production settings in order to ensure your models can stand the test of time.
- Project: Predict Customer Churn with Clean Code
- Source code: vnk8071/clean_code
2. Building a Reproducible Model Workflow
- This course empowers the students to be more efficient, effective, and productive in modern, real-world ML projects by adopting best practices around reproducible workflows. In particular, it teaches the fundamentals of MLops and how to: a) create a clean, organized, reproducible, end-to-end machine learning pipeline from scratch using MLflow b) clean and validate the data using pytest c) track experiments, code, and results using GitHub and Weights & Biases d) select the best-performing model for production and e) deploy a model using MLflow. Along the way, it also touches on other technologies like Kubernetes, Kubeflow, and Great Expectations and how they relate to the content of the class.
- Project: Build an ML Pipeline for Short-term Rental Prices in NYC
- Source code: vnk8071/reproducible_model_workflow
3.Deploying a ML Model to Cloud Application Platform with FastAPI
- This course teaches students how to robustly deploy a machine learning model into production. En route to that goal students will learn how to put the finishing touches on a model by taking a fine grained approach to model performance, checking bias, and ultimately writing a model card. Students will also learn how to version control their data and models using Data Version Control (DVC). The last piece in preparation for deployment will be learning Continuous Integration and Continuous Deployment which will be accomplished using GitHub Actions and Heroku, respectively. Finally, students will learn how to write a fast, type-checked, and auto-documented API using FastAPI.
- Project: Deploy a Machine Learning Model to Cloud Application Platform
- Source code: vnk8071/deploy_ml_pipeline_in_production
4. ML Model Scoring and Monitoring
- This course will help students automate the devops processes required to score and re-deploy ML models. Students will automate model training and deployment. They will set up regular scoring processes to be performed after model deployment, and also learn to reason carefully about model drift, and whether models need to be retrained and re-deployed. Students will learn to diagnose operational issues with models, including data integrity and stability problems, timing problems, and dependency issues. Finally, students will learn to set up automated reporting with API’s.
- Project: A Dynamic Risk Assessment System
- Source code: vnk8071/ml_model_scoring_and_monitoring
Contact
- Gmail: nguyenkhoi8071@gmail.com
- Website: khoispace.io.vn
- LinkedIn: linkedin.com/in/khoivn8071
- Github: github.com/vnk8071