GitHub - NYP-AI/Learning-Materials: A Public Repository containing all of NYP AI’s past event materials
A Public Repository containing all of NYP AI’s past event materials - GitHub - NYP-AI/Learning-Materials: A Public Repository containing all of NYP AI’s past event materials

NYP AI Summer Camp 2020 is the first of its kind student-led bootcamp for Artificial Intelligence.

Through 4 days of accelerated learning of AI, students learnt an immense amount of AI theory and implementation. Students went from being complete beginners to being able to Predict House Prices, Preprocess Unclean datasets and even classify Spam messages. Delving into the fields of prediction, classification and Natural Language Processing (NLP), it proved to be a tough yet fulfilling journey for many students.

Day 1: AI Essentials + Regression

Day 1...

Being the first day of our Summer Camp, we had to ensure students had a solid understanding of Machine Learning, which would aid in their learning for the next few days.

What is ML?

Students were introduced to popular Machine Libraries in Python, namely: Scikit-learn, numpy and pandas. They were taught how to implement these libraries using python code.

Train and Predict in 3 lines of code?

As students were being introduced to predictive algorithms using Machine Learning, they were taught about the inner workings of Linear Regression, a popular Regressional Algorithm....

TL;DR Linear Regression
Our Slides showing an overview of the Machine Learning Pipeline

...And went on to predict housing prices using their newly acquired skills.

Day 2: Image Classification

Day 2...

Students were taught about one of the most popular classification algorithms: Logsitic Regression.

Multiclass classification with Logistic Regression

They went on to learn how to preprocess images, converting images into an array of numbers.

Converting an RGB images to a 3D numpy array

Having learnt the basics of Image Preprocessing, we went on to preprocess our own weather dataset, which consisted of 4 different weather types. Using this preprocessed dataset, we went to train our own weather classifier using Logsitic Regression.

Code to preprocess the dataset
Code for the preprocessing of image to predict

Day 3: Data Pre-Processing

Day 3....

Unfortunately, Real-World Data isn't as clean as we wish them to be. Students learnt how to deal with Dataset Problems, namely: Missing Values and Categorical Values.

Day 3 Lesson Plan
Imputation with mean

Students then went on to preprocess the Titanic dataset, a popular competition hosted on Kaggle.

Titanic pipeline for pre-processing

Want to use a different algorithm? No problem!

Random Forest...

Day 4: NLP & Fine Tuning

Day 4...

How would be let our Machine Learning Algorithm learn for text? Tokenization and Count Vectorizer, here we come...

Tensorflow...for text pre-processing
Count Vectorizer

Putting all these pre-processing to use, we created a text classifer using Logistic Regression, which would classify tweets as Disaster or Not Disaster.

Let's classify Disaster Tweets

Let's talk about hyperparameters, parameters which affect our model's performance...

So many choices, so little time...

With so many parameters to tune, how will we find the optimal combinations?

GridSearch saves the day

Day 5: Spam Classification Competition

What's a bootcamp without a competition to end it off?

Kaggle Competition

Students went to form teams of up to 4. This 2 day competition saw participants using the skills acquired from the past 4 days: Pre-processing, NLP and classification.

Congrats to our team teams!

NYP AI Summer Camp 2020 was indeed a new experience for all of us. We're extremely glad to have this opportunity to organize such an event, to have the privilege to teach over 70 students about AI.

Thanks for making Summer Camp a success!

We're always exploring new aspects of AI to teach, always looking for opportunities to push for the widespread education of AI. Do keep a lookout for our future events!

If you are interested in potential collaborations with NYP AI, we'd like to hear from you: https://about.nyp.ai/collaborate :)

Till then~