Great to see they have a nice introductory section to feature engineering! Featu...

fnl · on March 1, 2018

Your comment got me interested in this course. However, all I could find about feature engineering there is what you linked to, directly.

Given that entire scientific careers, books, and conferences are built around the topic of feature engineering, and at least IMO good ML tools live or die with good feature engineering (in its broadest sense, for you deep learning fanatics :-)) that doesn't seem like more than the bare minimum I'd expect from any ML "crash-course" that is to be taken serious (and I wouldn't expect an ounce less from Google... :-)).

Am I missing something, maybe?

In any case, nice work of your own, and thanks for sharing it!

kmax12 · on March 2, 2018

10 seconds into the video of feature engineering they say that feature engineering takes up about 75% of the time https://developers.google.com/machine-learning/crash-course/...

They understand the value, but but if you keep watching, they don’t seen go beyond the basic.

minimaxir · on March 1, 2018

Although I'm normally skeptical of AI/ML courses, that section on feature engineering do's-and-do-nots is new and surprisingly under-discussed. It's very useful even outside of AI/ML.

kmax12 · on March 1, 2018

I agree.

I expect that as companies increase their focus on finding practical applications of ML / AI, the topic will start to get more attention in these tutorials, as well as from researchers. Right now, too many people assume you already have a feature matrix, which is rarely the case when working on real world problems.

cjalmeida · on March 1, 2018

OTOH, automating feature engineering is a thing. There are papers on using unsupervised methods to do that.

The 1st place in Kaggle's Porto Seguro competition trained an Autoencoder on raw data to extract features.

cuchoi · on March 2, 2018

How do you select features created with featuretools? The problem with automated feature engineering is that you end with too many irrelevant features, and I haven't found a good guide on feature selection.