• Latest
  • Trending
  • All
  • News
  • Business
  • Politics
  • Science
  • World
  • Lifestyle
  • Tech
Bringing order to data lakehouses, Onehouse is expanding its Apache Hudi technology with $25M raise

Bringing order to data lakehouses, Onehouse is expanding its Apache Hudi technology with $25M raise

February 2, 2023
Sam Asghari addresses rumors of Britney Spears marital issues after ditching rings

Sam Asghari addresses rumors of Britney Spears marital issues after ditching rings

March 31, 2023
Minor League Salaries Will Double Under New Deal

Minor League Salaries Will Double Under New Deal

March 31, 2023
How a Trump-Era Rollback Mattered for Silicon Valley Bank’s Demise

How a Trump-Era Rollback Mattered for Silicon Valley Bank’s Demise

March 31, 2023
Love Is Blind Season 4: Are Bliss Poureetezadi and Zack Goytowski Still Together?

Love Is Blind Season 4: Are Bliss Poureetezadi and Zack Goytowski Still Together?

March 31, 2023
Bail Law Is a Key Stumbling Block as New York’s Budget Deadline Looms

Bail Law Is a Key Stumbling Block as New York’s Budget Deadline Looms

March 31, 2023
‘Rust’ assistant director David Halls sentenced to probation

‘Rust’ assistant director David Halls sentenced to probation

March 31, 2023
The 5 Best New TV Shows Our Critic Watched in March 2023

The 5 Best New TV Shows Our Critic Watched in March 2023

March 31, 2023
Cherry Blossom Season Is Here—Celebrate It With These Japanese Snacks

Cherry Blossom Season Is Here—Celebrate It With These Japanese Snacks

March 31, 2023
GM EVs will ditch Apple CarPlay just before Apple’s huge upgrade

GM EVs will ditch Apple CarPlay just before Apple’s huge upgrade

March 31, 2023
Review: A Trip From Light to Dark With the National Ballet of Canada

Review: A Trip From Light to Dark With the National Ballet of Canada

March 31, 2023
Getty and London’s National Portrait Gallery to Jointly Buy a Masterpiece

Getty and London’s National Portrait Gallery to Jointly Buy a Masterpiece

March 31, 2023
Jack LaLanne Biopic Based On Steven Kaminsky Biography ‘Anything Is Possible’ In Works from Lisa Saltzman, Gunnar Peterson

Jack LaLanne Biopic Based On Steven Kaminsky Biography ‘Anything Is Possible’ In Works from Lisa Saltzman, Gunnar Peterson

March 31, 2023
DNYUZ
  • Home
  • News
    • U.S.
    • World
    • Politics
    • Opinion
    • Business
    • Crime
    • Education
    • Environment
    • Science
  • Entertainment
    • Culture
    • Music
    • Movie
    • Television
    • Theater
    • Gaming
    • Sports
  • Tech
    • Apps
    • Autos
    • Gear
    • Mobile
    • Startup
  • Lifestyle
    • Arts
    • Fashion
    • Food
    • Health
    • Travel
No Result
View All Result
DNYUZ
No Result
View All Result
Home News

Bringing order to data lakehouses, Onehouse is expanding its Apache Hudi technology with $25M raise

February 2, 2023
in News
Bringing order to data lakehouses, Onehouse is expanding its Apache Hudi technology with $25M raise
519
SHARES
1.5k
VIEWS
Share on FacebookShare on Twitter

Managed data lakehouse vendor Onehouse today announced that it has raised $25 million in a series A round of funding to help further advance its go-to-market and technology efforts based on the open-source Apache Hudi project.

Onehouse emerged from stealth a year ago, in Feb. 2022, as the first commercial vendor providing support and service for Apache Hudi. Hudi, which is an acronym for Hadoop Upserts Deletes and Incrementals, traces its roots back to Uber in 2016 where it was first developed as a technology to help bring order to the massive volumes of data that were being stored in data lakes.

The Hudi technology provides a data lake table format as well as services to help with clustering, archiving and data replication. Hudi competes against multiple other open-source data lake table technologies including Apache Iceberg and Databricks Delta Lake.

The goal at Onehouse is to create a cloud-managed service that can help organizations benefit from a managed data lakehouse. Alongside the new funding, Onehouse announced its Onetable initiative that aims to enable users of Iceberg and Delta Lake to interoperate with Hudi. With Onetable, organizations can use Hudi for data ingestion into a data lake while still being able to benefit from query engine technologies that run on Iceberg — including Snowflake — as well as Databricks’ Delta Lake.

“We are really trying to build a new way of thinking about data architecture,” Onehouse founder and CEO Vinoth Chandar, told VentureBeat. “We are very convinced that people should start with an interoperable lakehouse.”

Understanding the data lakehouse trend

The data lakehouse is a term first coined by Databricks. 

The goal of a data lakehouse is to take the best aspects of a data lake, which provides large volumes of data storage, with a data warehouse that provides structured data services for queries and data analytics. A 2022 report from Databricks identified a number of key benefits of the data lakehouse approach including improved data quality, increased productivity and better data collaboration.

A key component of the data lakehouse model is the ability to apply structure to data lakes, which is where the open-source data lake table formats, including Hudi, Delta Lake and Iceberg fit in. Multiple vendors are now building full platforms with those table formats as a foundation.

Among the many supporters of Apache Iceberg is Cloudera, which launched its data lakehouse service in August 2022. Dremio is another strong Iceberg supporter, using it as part of its data lakehouse platform. Even Snowflake, one of the pioneers of the cloud data warehouse concept, is now supporting Iceberg.

Onetable isn’t another data lake table format 

At the core of the major data lake formats today, including Hudi, Delta Lake and Iceberg, are files that organizations want to be able to use for analytics, business intelligence or operations.

A challenge that has emerged, though, is that vendor technologies have been increasingly vertically integrated — combining the data storage and query engines. Kyle Weller, head of product at Onehouse, explained he’s seen organizations confused about which vendor to choose based on which data lake table format approach is supported. The Onetable approach is intended to abstract away the differences across the data lake table formats, to create an interoperability layer.

“The goal and the mission of Onehouse is about decoupling data processing data query engines from how your core data infrastructure operates,” Weller told VentureBeat.

Weller added that at the foundation of many data lakes today are files stored in the Apache Parquet data storage format. What Onetable is essentially doing is providing a metadata layer on top of Parquet that enables easy translation from one table format to another.

Where Onetable fits into the data lakehouse use case

Chandar noted that Hudi provides advantages over other formats, such as transactional replication and fast data ingestion.

One potential use case where he sees the Onetable feature fitting in, is for organizations using Hudi to do massive volumes of data ingestion, but want to be able to use the data with another query engine or technology such as a Snowflake Data Cloud deployment, for some type of analytics.

Chandar said a lot of companies have data sitting in data warehouses and they are increasingly deciding to build a data lake either because of costs or because they want to start a new data science team. The first thing those organizations will do is data ingestion, bringing all their transactional data to the lake, which is where Chandar said Hudi and the Onehouse service excels.

Now with the benefit of the Onetable technology, the same organization that has ingested data into Onehouse, can also use other technologies such as Snowflake and Databricks for data queries on the data, for analytics.

Looking forward for both Hudi and the Onehouse platform, Chandar emphasized that further optimizing the ability for organizations to utilize data quickly will remain a key theme.

“We have announced in the Hudi project that we want to add a caching layer at some point,” he said. “We are thinking about anything and everything around data and how we can optimize it really well.”

The post Bringing order to data lakehouses, Onehouse is expanding its Apache Hudi technology with $25M raise appeared first on Venture Beat.

Share208Tweet130Share

Trending Posts

Fact Check: Trump Suggests Transgender ‘Drugs’ Fueled Nashville Shooter

Fact Check: Trump Suggests Transgender ‘Drugs’ Fueled Nashville Shooter

March 31, 2023
4 New Artists You Need to Hear

4 New Artists You Need to Hear

March 31, 2023
This isn’t the end of Trump – it’s his golden opportunity

This isn’t the end of Trump – it’s his golden opportunity

March 31, 2023
Ivanka Offers Extremely Low Energy Statement in Support of Dad

Ivanka Offers Extremely Low Energy Statement in Support of Dad

March 31, 2023
Trump Indictment Proves Fox News Simply Cannot Quit Its MAGA Daddy

Trump Indictment Proves Fox News Simply Cannot Quit Its MAGA Daddy

March 31, 2023

Copyright © 2023.

Site Navigation

  • About
  • Advertise
  • Privacy & Policy
  • Contact

Follow Us

No Result
View All Result
  • Home
  • News
    • U.S.
    • World
    • Politics
    • Opinion
    • Business
    • Crime
    • Education
    • Environment
    • Science
  • Entertainment
    • Culture
    • Gaming
    • Music
    • Movie
    • Sports
    • Television
    • Theater
  • Tech
    • Apps
    • Autos
    • Gear
    • Mobile
    • Startup
  • Lifestyle
    • Arts
    • Fashion
    • Food
    • Health
    • Travel

Copyright © 2023.

We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies.
Cookie settingsACCEPT
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT