ARTICLES
Data Exchange Podcast (Episode 79) Hyun Kim
This week’s guest is Hyun Kim, co-founder and CEO of Superb AI, a startup building tools to help companies manage data across the entire machine learning application lifecycle. This includes tools to label, store, and monitor data assets that power all computer vision...
Guest Post: “ML Data”: The Past, Present, and Future
In this article, co-founder and CTO of Galileo Atindriyo Sanyal gives a fascinating overview of the ‘ML data intelligence’ evolution and shares a few insights on why the organizations that obsess on their ML data quality will quickly greatly outperform those that...
High-quality data meets enterprise MLOps
According to the 2021 enterprise trends in machine learning report by Algorithmia, 83% of all organizations have increased their AI/ML budgets year-on-year, and the average number of data scientists employed has grown by 76% over the same period. However, the process...
Resilient human-in-the-loop pipelines with Pachyderm and Toloka
Why data prep is hard Many data scientists and machine learning teams report that they spend about 80% of their time preparing, managing, or curating their datasets. There are three things that have enabled the ML revival over the last 5–10 years: breakthroughs in...
Buy or Build for ML Ops
Where to get started? Let’s start with admitting that there is no 100% waterproof pros and cons list of buying vs building an ML Ops stack that works for everyone. It always depends on your specific situation. In this article, I try to highlight some of the most...
How to convert DMatrix to NumPy format for your machine learning model?
Miles to kilometers, celsius to Fahrenheit, and Google Docs to word are just some of the conversions we all do in our day-to-day life. When we fly overseas (hello pre-COVID days) or work with a company with different sharing methods, we often find ourselves converting...
Automating MLOps for Deep Learning: How to Operationalize DL with Minimal Effort
Operationalizing AI pipelines is notoriously complex. For deep learning applications, the challenge is even greater, due to the complexities of the types of data involved. Without a holistic view of the pipeline, operationalization can take months, and will require...
The Model’s Shipped; What Could Possibly go Wrong?
In our last post we took a broad look at model observability and the role it serves in the machine learning workflow. In particular, we discussed the promise of model observability & model monitoring tools in detecting, diagnosing, and explaining regressions...
Easy ML Monitoring with Toloka
Data-Driven AI Meetup 3: Easy ML monitoring with Toloka Boris Tseytlin, ML Research Scientist. In this video Toloka Boris will explain how to use Toloka to detect data drifts and model degradation. During this hands-on tutorial, you will learn how to set up dependable...
9 Reasons Why You Need an Immutable Database
An immutable database means the data within it cannot be deleted or modified. There are numerous reasons why an immutable database is beneficial for you and this article explains some of those arguments. Martin Kleppmann, who is a serial entrepreneur and...
Connect with Us
Follow US