Machine learning in Planetary Science: compressing Pluto images with scikit-learn and PCA

Posted on October 19, 2016 by matteomycarta

In a previous post I showed some of the beautiful new images of Pluto from New Horizon’s mission, coloured using the new Matplotlib perceptual colormaps:

More recently I was experimenting with Principal Component Analysis in scikit-learn, and one of the things I used it for was compression of some of these Pluto images. Below is an example of the first two components from the False Color Pluto image:

You can take a look at the Python code available on this Jupyter Notebook. There are of course better ways of compressing images, but this was a fun way to play around with PCA.

In a follow-up post I will use image registration and image processing techniques to reproduce from the raw channels NASA’s Psychedelic Pluto Image.

Machine learning in geoscience with scikit-learn. Part 1: checking, tidying, and analyzing the dataset

Posted on October 8, 2016 by matteomycarta

The idea behind this series of articles is to show how to predict P-wave velocity, as measured by a geophysical well log (the sonic), from a suite of other logs: density, gamma ray, and neutron, and also depth, using Machine Learning.

The log suite is from the same well that Alessandro Amato del Monte used in the Seismic Petrophysics Notebook accompanying his Geophysical tutorial article on The Leading Edge.

I will explore different Machine Learning methods from the scikit-learn Python library and compare their performances.

To wet your appetites, here’s an example of P-wave velocity, Vp, predicted using a cross-validated linear model, which will be the benchmark for the performance of other models, such as SVM and Random Forest:

In the first notebook, which is already available on GitHub here, I show how to use the Pandas and Seaborn Python libraries to import the data, check it, clean it up, and visualize to explore relationships between the variables. For example, shown below is a heatmap with the pairwise Spearman correlation coefficient between the variables (logs):

Stay tuned for the next post / notebook!

PS: I am very excited by the kick-off of the Geophysical Tutorial (The Leading Edge) Machine Learning Contest 2016. Check it out here!

Machine learning in geoscience and planetary science with scikit-learn: series outline

Posted on October 1, 2016 by matteomycarta

Machine learning in geoscience with scikit-learn. Part 1: checking, tidying, and analyzing the dataset

Machine learning in Planetary Science: compressing Pluto images with scikit-learn and PCA

Machine Learning in Geoscience with Scikit-learn. Part 2: inferential statistics and domain knowledge to select features for oil prediction

Machine learning in geoscience with scikit-learn. Part 3: the SEG ML contest

Machine Learning in Geoscience with Scikit-learn. Part 4: TBE

Looking for opportunities

Posted on September 16, 2016 by matteomycarta

As of yesterday, I no longer have a full-time day job.

I am looking for opportunities.

I’d love to hear about projects in geophysics, computational geoscience, data science, machine learning. Feel free to get in touch with me at matteo@mycarta.ca.

Thanks,

Matteo

MyCarta

A blog about Geoscience, Visualization, Data Science, AI

Tag Archives: machine learning

Machine learning in Planetary Science: compressing Pluto images with scikit-learn and PCA

Machine learning in geoscience with scikit-learn. Part 1: checking, tidying, and analyzing the dataset

Machine learning in geoscience and planetary science with scikit-learn: series outline

Looking for opportunities

Share this:

Share this:

Share this:

Share this: