• Latest
  • Trending
  • All
  • News
  • Business
  • Politics
  • Science
  • World
  • Lifestyle
  • Tech
Databricks debuts ChatGPT-like Dolly, a clone any enterprise can own

Databricks debuts ChatGPT-like Dolly, a clone any enterprise can own

March 24, 2023
With Migrant Flights, DeSantis Shows Stoking Outrage Is the Point

With Migrant Flights, DeSantis Shows Stoking Outrage Is the Point

June 7, 2023
Smoke Leads to Cancellations of ‘Hamilton’ on Broadway and ‘Hamlet’ in Central Park

Smoke Leads to Cancellations of ‘Hamilton’ and ‘Camelot’ on Broadway and ‘Hamlet’ in Central Park

June 7, 2023
In Japan, embarrassed employees pay agencies to quit for them

In Japan, embarrassed employees pay agencies to quit for them

June 7, 2023
Massachusetts sober home operator pleads not guilty in COVID relief fraud scheme

Massachusetts sober home operator pleads not guilty in COVID relief fraud scheme

June 7, 2023
The Republican Silly Season Has Begun

The Republican Silly Season Has Begun

June 7, 2023
New York City sues 30 counties over ‘xenophobic’ orders banning migrant relocations

New York City sues 30 counties over ‘xenophobic’ orders banning migrant relocations

June 7, 2023
DeSantis defends flying migrants to California as he meets with sheriffs near border

DeSantis defends flying migrants to California as he meets with sheriffs near border

June 7, 2023
Orange Skies and Burning Eyes as Smoke Shrouds New York City

Orange Skies and Burning Eyes as Smoke Shrouds New York City

June 7, 2023
Meadows’ Attorney Denies Making Trump Probe Immunity Deal: ‘Complete Bullshit’

Meadows’ Attorney Denies Making Trump Probe Immunity Deal: ‘Complete Bullshit’

June 7, 2023
Putin’s Loudest Crony Full-On Panics Over Shelling Inside Russia

Putin’s Loudest Crony Full-On Panics Over Shelling Inside Russia

June 7, 2023
Loonie Set to Extend Rally With Bank of Canada Seen Raising Rates

Loonie Set to Extend Rally With Bank of Canada Seen Raising Rates

June 7, 2023
Remembering a Massacre That China Keeps Trying to Erase

Remembering a Massacre That China Keeps Trying to Erase

June 7, 2023
DNYUZ
  • Home
  • News
    • U.S.
    • World
    • Politics
    • Opinion
    • Business
    • Crime
    • Education
    • Environment
    • Science
  • Entertainment
    • Culture
    • Music
    • Movie
    • Television
    • Theater
    • Gaming
    • Sports
  • Tech
    • Apps
    • Autos
    • Gear
    • Mobile
    • Startup
  • Lifestyle
    • Arts
    • Fashion
    • Food
    • Health
    • Travel
No Result
View All Result
DNYUZ
No Result
View All Result
Home News

Databricks debuts ChatGPT-like Dolly, a clone any enterprise can own

March 24, 2023
in News
Databricks debuts ChatGPT-like Dolly, a clone any enterprise can own
646
SHARES
1.8k
VIEWS
Share on FacebookShare on Twitter

Was data lakehouse platform Databricks becoming an OpenAI rival on anyone’s 2023 bingo card? Well, hello, Dolly.

Today, in an effort the company says is meant to build on their longtime mission to democratize AI for the enterprise, Databricks released the code for an open-source large language model (LLM) called Dolly — named after Dolly the sheep, the first cloned mammal — that it said companies can use to create instruction-following chatbots similar to ChatGPT.

The model can be trained, the company explained in a blog post, on very little data and in very little time. “With 30 bucks, one server and three hours, we’re able to teach [Dolly] to start doing human-level interactivity,” said Databricks CEO Ali Ghodsi.

There are many reasons a company would prefer to build their own LLM model rather than sending data to a centralized LLM provider that serves a proprietary model behind an API, the blog post explained. Handing sensitive data over to a third party may not be an option, while organizations may have specific needs as far as model quality, cost and desired behavior.

“We believe that most ML users are best served long term by directly owning their models,” said the blog post.

Databricks found ChatGPT-like qualities don’t require latest or largest LLM

According the blog post, Databricks said Dolly is meant to show that anyone “can take a dated off-the-shelf open source large language model and give it magical ChatGPT-like instruction.” Surprisingly, it said, instruction-following does not seem to require the latest or largest models — Dolly is only 6 billion parameters, compared to 175 billion for GPT-3.

“We’ve been calling ourselves a data and AI company since 2013, and we have close to 1000 customers that have been using some kind of large language model on Databricks,” said Ghodsi, who told VentureBeat he was “blown away” when ChatGPT was launched at the end of November 2022, but realized only a few companies on the planet have the massive language models necessary for ChatGPT-level ability.

“Most people were thinking, do we have to all leverage these proprietary models that these very few companies have? And if so, do we have to give them our data?” he said.

The answer to both of those questions is no: In February, Meta released the weights for a set of high-quality (but not instruction-following) language models called LLaMA to academic researchers, trained for over 80,000 GPU-hours each. Then, in March, Stanford built the Alpaca model, which was based on LLaMA, but tuned on a small dataset of 50,000 human-like questions and answers that, surprisingly, made it exhibit ChatGPT-like interactivity.

Inspired by those two options, Databricks was able to take an existing open source 6 billion parameter model from EleutherAI and slightly modify it to elicit instruction following capabilities such as brainstorming and text generation not present in the original model, using data from Alpaca.

Surprisingly, the modified model worked very well. According to the blog post, this suggests that “much of the qualitative gains in state-of-the-art models like ChatGPT may owe to focused corpuses of instruction-following training data, rather than larger or better-tuned base models.”

LLM models will not be the hands of only a few companies

Ghodi said that going forward there will many more LLM models that will become cheaper and cheaper — and won’t be in the hands of only a few companies.

“Every organization on the planet will probably utilize these,” he said. “Our belief is that in every industry, the winning, leading companies will be data and AI companies that will be leveraging this kind of technology and will have these kinds of models.”

The post Databricks debuts ChatGPT-like Dolly, a clone any enterprise can own appeared first on Venture Beat.

Share258Tweet162Share

Trending Posts

Why Prince Harry Is Litigating the Past in His High Court Testimony

Why Prince Harry Is Litigating the Past in His High Court Testimony

June 7, 2023
Shannon Beador on what led to John Janssen split: ‘Never going to get back together’ 

Shannon Beador on what led to John Janssen split: ‘Never going to get back together’ 

June 7, 2023
Smoke Leads to Cancellations of ‘Hamilton’ on Broadway and ‘Hamlet’ in Central Park

Smoke Leads to Cancellations of ‘Hamilton’ on Broadway and ‘Hamlet’ in Central Park

June 7, 2023
PGA-LIV golf deal sparks fury of 9/11 families, human rights group

PGA-LIV golf deal sparks fury of 9/11 families, human rights group

June 7, 2023
Fox News Claims Tucker Carlson Breached His Contract With Twitter Show

Fox News Claims Tucker Carlson Breached His Contract With Twitter Show

June 7, 2023

Copyright © 2023.

Site Navigation

  • About
  • Advertise
  • Privacy & Policy
  • Contact

Follow Us

No Result
View All Result
  • Home
  • News
    • U.S.
    • World
    • Politics
    • Opinion
    • Business
    • Crime
    • Education
    • Environment
    • Science
  • Entertainment
    • Culture
    • Gaming
    • Music
    • Movie
    • Sports
    • Television
    • Theater
  • Tech
    • Apps
    • Autos
    • Gear
    • Mobile
    • Startup
  • Lifestyle
    • Arts
    • Fashion
    • Food
    • Health
    • Travel

Copyright © 2023.

We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies.
Cookie settingsACCEPT
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT