Splice Machine Launches the Splice Machine Feature Store to Simplify Feature Engineering and Democratize Machine Learning

January 19, 2021

With a true SQL HTAP database, the Feature Store streamlines the most time-consuming and expensive task of the data science life cycle and enables ML models to integrate past and present data to predict the most accurate future

San Francisco, CA, January 19, 2021Splice Machine, the only scale-out SQL database with built-in machine learning, today announced it has launched the Splice Machine Feature Store. The solution will help more companies operationalize machine learning by reducing the complexity of feature engineering and allow data scientists to make the right decisions based on real-time data.

Despite the hype around AI and machine learning, seven out of 10 executives whose companies had made investments in artificial intelligence reported minimal or no impact from them, according to a 2019 research report from MIT. Those surveyed stated that creating a machine learning model and putting it into operation in an enterprise environment are two very different things.

“The capacity to create, share, explain and reliably reproduce features for a given model is paramount to the success of a data science team,” said Monte Zweben, CEO, Splice Machine. “The old way of doing things meant data science operations were simply not scalable. The Splice Machine Feature Store enables you to harness complex analytics in real time and transform real-time data into features, so your models are never uninformed. It also stores feature history making training set creation a single click.”

Feature engineering is the most time-consuming and expensive task of the data science life cycle. As companies work to operationalize machine learning, current approaches are not scalable because data science productivity is too low to enable widespread adoption. Simplifying the data science workflow by providing necessary architecture and automating feature serving with feature stores are two of the most important ways to make machine learning easy, accurate and fast at scale.

The Splice Machine Feature Store Platform solves some of the biggest pain points of operationalizing machine learning, including:

  • Reducing the effort of feature engineering
  • Helping to solve for governance issues, such as bias, drift, or regulatory oversight
  • Scaling data science operations
  • Reducing monetary loss from the creation of inaccurate models

This will help data scientists realize numerous benefits, including:

  • Achieving faster deployments of AI/ML into production by reusing features and avoiding duplicative feature engineering
  • Spending 80% less time on feature engineering
  • Developing more informative models via automatic aggregation of raw data
  • Gaining predictive accuracy of models with near real-time feature updates and consistent training sets

“As our Clinical Advisor application continues to be adopted by more and more clinicians, there will be increasingly more data at our disposal to help hone and create new features that can further improve our application’s precision in identifying the course of disease and improving patient outcomes,” said IPH (Innovative Precision Health) Chief Scientific Officer, Mark Gudesblatt, MD. “Splice Machine’s feature store makes it possible to leverage features developed by data scientists for a particular disease like Multiple Sclerosis to be leveraged by other data scientists for other diseases like Parkinson’s or Alzheimer’s.”

“We had this sort of a feature store at Airbnb, but it was limited by the fact that we were largely on HDFS,” said Robert Yi, CDO at Dataframe and former Airbnb data scientist. “It enabled users to share features, but it didn’t solve the online/offline problem. But the solution can obviously be much more elegant if you start with a more amenable database that can function in realtime. Splice Machine seems to be doing exactly that – MLflow integration, database re-injection, Spark lazy loading, easy deployment, and API-less access.”

Zweben added, “The Splice Machine Feature Store is the only one in the market that does not have to synchronize two underlying data engines — an online store and an offline store, simplifying the architecture with one store, lowering costs, and eliminating latency.”

To learn more about the Splice Machine Feature Store Platform, visit https://splicemachine.com/product/feature-store/.

Zweben will also discuss the importance of feature stores at the upcoming TWIMLcon 2021, being held online January 19-29, 2021. Monte’s session, “Unified MLOps: How Database Deployment and a Feature Store Can Revolutionize ML”, is slated for Wednesday, January 20, 2021 at 11:40 AM PST and a workshop, “Hands-On Feature Store and Model Deployment with Splice Machine”, as scheduled for Thursday, January 28 at 9:00 AM PST. More information about the event can be found here: TWIMLcon

About Splice Machine
Splice Machine makes it fast and easy to deploy AI applications. The company combines SQL database and machine learning technology into one seamless platform to slash software, infrastructure and people costs and to vastly improve data science productivity. Enterprises build new intelligent applications on the Splice Machine platform, extend existing applications with AI, and modernize their applications in the cloud with virtually no rewrites. The Splice Machine Livewire solution enables industrial clients to ingest live sensor data to predict operational problems in real-time to avoid costly outages. Splice Machine is available as a fully-managed cloud service on AWS, Azure, and GCP and is also available on-premises. For more information on Splice Machine, visit www.splicemachine.com.