Skip to content
    Latest

    Data Exploration with Pandas Profiler and D-Tale

    We all have heard how data is the new oil. I always say that if that is the case, we need to go through some refinement process before that raw oil is converted into useful...

    New G3 Instances in AWS - Worth it for Machine Learning?

    We benchmarked AWS’s new G3 instances for deep learning tasks and found they significantly outperform the older P2 instances. The new G3 instances...

    Recommender Systems through Collaborative Filtering

    This is a technical deep dive into the collaborative filtering algorithm and how to use it in practice. From Amazon recommending products you may be...

    Data Science in the Enterprise: Insights from eBay, Stitch Fix, Teleon Health, and RISELab

    We recently hosted a panel discussion with several data science leaders about organizational design and tooling for enterprise data science. Watch...

    Scaling Machine Learning to Modern Demands

    This is a Data Science Popup session by Hristo Spassimirov Paskov, Founder & CEO of ThinkFast.

    Horizontal Scaling for Parallel Experimentation

    The amount of time data scientists spend waiting for experiment results is the difference between making incremental improvements and making...

    Sampling Based Methods for Class Imbalance in Datasets

    Imagine you are a medical professional who is training a classifier to detect whether an individual has an extremely rare disease. You train your...

    Git Integration in Domino

    We recently released new functionality that provides first-class integration between Domino and git. This post describes the new feature, and...

    Succeeding with Alternative Data and Machine Learning

    Perhaps the biggest insight in feature engineering in the last decade was the realization that you could predict a person's behavior by understanding...

    The Cost of Doing Data Science on Laptops

    At the heart of the data science process are the resource intensive tasks of modeling and validation. During these tasks, data scientists will try...

    Numenta Anomaly Benchmark: A Benchmark for Streaming Anomaly Detection

    With sensors invading our everyday lives, we are seeing an exponential increase in the availability of streaming, time-series data. Finding anomalies...

    Principles of Collaboration in Data Science

    Data science is no longer a specialization of a single person or small group. It is now a key source of competitive advantage, and as a result, the...

    Building a Model is the Least Important Part of Your Job

    In this Data Science Popup session, Kimberly Shenk, Director of Data Science Solutions at Domino Data Lab, explains why building models is the least...

    AI in the Enterprise: Making Corporations Smart Again

    In this Data Science Popup session, Danny Lange, VP of AI and Machine Learning at Unity Technologies, gives an inside look at practical applications...

    Deep Learning on GPUs without the Environment Setup in Domino

    We have seen an explosion of interest among data scientists who want to use GPUs for training deep learning models. While the libraries to support...

    Data Science at Instacart: Making On-Demand Profitable

    In this Data Science Popup session, Jeremy Stanley, VP of Data Science at Instacart, gives an inside look at how Instacart uses data science to...

    Practical Data Science at Gusto and General Assembly

    In this Data Science Popup panel led by Michael Manapat, Product and Machine Learning at Stripe, we learn about practical applications of data...

    Subscribe to the Data Science Blog

    Receive data science tips and tutorials from leading Data Scientists right to your inbox.