Jump to content
Wikipedia The Free Encyclopedia

Databricks

From Wikipedia, the free encyclopedia
American software company
Databricks, Inc.
Company typePrivate
IndustryComputer software
Founded2013; 12 years ago (2013)[1]
Founders
Headquarters,
United States
Key people
  • Ali Ghodsi
  • (CEO)
  • Ion Stoica
  • (Executive chairman)
RevenueIncrease 1ドル.6 billion (2023)[2]
Number of employees
c. 8,000 (2025)[3]
Websitedatabricks.com

Databricks, Inc. is a global data, analytics, and artificial intelligence (AI) company, founded in 2013 by the original creators of Apache Spark.[1] [4] The company provides a cloud-based platform to help enterprises build, scale, and govern data and AI, including generative AI and other machine learning models.[5]

Databricks pioneered the data lakehouse, a data and AI platform that combines the capabilities of a data warehouse with a data lake, allowing organizations to manage and use both structured and unstructured data for traditional business analytics and AI workloads.[6] The company similarly develops Delta Lake, an open-source project to bring reliability to data lakes for machine learning and other data science use cases.[7]

History

[edit ]
Databricks booth (2023)

Databricks grew out of the AMPLab project at University of California, Berkeley that was involved in making Apache Spark, an open-source distributed computing framework built atop Scala. The company was founded by Ali Ghodsi, Andy Konwinski, Arsalan Tavakoli-Shiraji, Ion Stoica, Matei Zaharia,[8] Patrick Wendell, and Reynold Xin.[citation needed ]

In November 2017, the company was announced as a first-party service on Microsoft Azure via integration Azure Databricks.[9]

In February 2021, together with Google Cloud, Databricks provided integration with the Google Kubernetes Engine and Google's BigQuery platform.[10] At this point in time, the company said more than 5,000 organizations used its products.[11]

Fortune ranked Databricks as one of the "Best Large Workplaces for Millennials" in 2021.[12]

In November 2023, Databricks unveiled the Databricks Data Intelligence Platform, a new offering that combines the unification benefits of the lakehouse with MosaicML’s Generative AI technology to enable customers to better understand and use their own proprietary data.[13]

The firm was valued at 62ドル billion in December 2024,[14] following a funding round that raised one of the largest amounts in history, an equivalent to the largest single AI investment ever made.[15]

Acquisitions

[edit ]

In June 2020, Databricks bought Redash, an open-source tool for data visualization and building of interactive dashboards.[16] In 2021, it bought German no-code company 8080 Labs whose product, bamboolib, allowed data exploration without any coding.[17] In May 2023, Databricks bought data security group Okera, extending Databricks data governance capabilities.[18] In June, it bought the open-source generative AI startup MosaicML for 1ドル.4 billion.[19] [20] In October, Databricks bought data replication startup Arcion for 100ドル million.[21] In what is believed to be its sixth acquisition, Databricks bought Tabular, a data-management system used by open source AI, for over 1ドル billion.[22]

In March 2023, in response to the popularity of OpenAI's ChatGPT, the company introduced an open-source language model, named Dolly after Dolly the sheep, that allowed developers to create chatbots. Dolly uses fewer parameters to produce similar results as ChatGPT, but Databricks had not released formal benchmark tests to show whether its bot actually matched the performance of ChatGPT.[23] [24] [25]

Databricks reported 1ドル.6 billion in revenue for the 2023 fiscal year, more than doubling its previous level.[26]

Funding

[edit ]

In September 2013, Databricks announced it raised 13ドル.9 million from Andreessen Horowitz and said it aimed to offer an alternative to Google's MapReduce system.[27] [28] Microsoft was a noted investor of Databricks in 2019, participating in the company's Series E at an unspecified amount.[29] [30] The company has raised 1ドル.9 billion in funding, including a 1ドル billion Series G led by Franklin Templeton at a 28ドル billion post-money valuation in February 2021. Other investors include Amazon Web Services, CapitalG (a growth equity firm under Alphabet Inc.) and Salesforce Ventures.[11] In August 2021, Databricks finished its eighth round of funding by raising 1ドル.6 billion and valuing the company at 38ドル billion.[31] In December 2024, Databricks announced a 10ドル billion financing at a valuation of 62ドル billion.[14]

Funding rounds
Series Date Amount (million $) Lead investors
A 2013 13.9[27] Andreessen Horowitz
B 2014 33[32] New Enterprise Associates
C 2016 60[33] New Enterprise Associates
D 2017 140[34] Andreessen Horowitz
E Feb. 2019 250[35] Andreessen Horowitz
F Oct. 2019 400[36] Andreessen Horowitz
G Jan. 2021 1,000[37] Franklin Templeton Investments
H Aug. 2021 1,600[38] Morgan Stanley
I Sep. 2023 500[39] Capital One Ventures, Nvidia
J Dec. 2024 10,000[40] Thrive Capital

Products

[edit ]

Databricks develops and sells a cloud data platform using the marketing term "lakehouse", a portmanteau of "data warehouse" and "data lake".[41] Databricks' Lakehouse is based on the open-source Apache Spark framework that allows analytical queries against semi-structured data without a traditional database schema.[42] In October 2022, Lakehouse received FedRAMP authorized status for use with the U.S. federal government and contractors.[43]

The company has also created Delta Lake, MLflow and Koalas, open source projects that span data engineering, data science and machine learning.[44] [45]

In June 2020, Databricks launched Delta Engine, a fast query engine for Delta Lake,[46] compatible with Apache Spark and MLflow.[47]

In November 2020, Databricks introduced Databricks SQL (previously called SQL Analytics) for running business intelligence and analytics reporting on top of data lakes. Analysts can query data sets with standard SQL or use connectors to integrate with business intelligence tools like Holistics,[48] Tableau, Qlik, SigmaComputing,[49] Looker, and ThoughtSpot.[50]

Databricks offers a platform for other workloads, including machine learning, data storage and processing, streaming analytics, and business intelligence.[51]

In early 2024, Databricks released the Mosaic set of tools for customizing, fine-tuning and building AI systems. It includes AI Vector Search for building RAG models; AI Model Serving, a service for deploying, governing, querying and monitoring models fine-tuned or pre-deployed by Databricks; and AI Pretraining, a platform for enterprises to create their own LLMs.[52]

In March 2024, Databricks released DBRX, an open-source foundation model. It has a mixture-of-experts architecture and is built on the MegaBlocks open-source project.[53] DBRX cost 10ドル million to create. At the time of launch, it was the fastest open-source LLM, based on commonly-used industry benchmarks. It beat other models like Llama 2 at solving logic puzzles and answering general knowledge questions, among other tasks. And while it has 136 billion parameters, it only uses 36 billion, on average, to generate outputs.[54] DBRX also serves as a foundation for companies to build or customize their own AI models. Companies can also use proprietary data to generate higher-quality outputs for specific use cases.[55]

In addition to building the Databricks platform, the company has co-organized massive open online courses about Spark[56] and a conference for the Spark community called the Data + AI Summit,[57] formerly known as Spark Summit.[58]

Collaborations

[edit ]

In December 2024, Databricks along with Wiz and Workday has decided to run their products on top of AWS via the new button called "Buy with AWS button".[59]

Operations

[edit ]

Databricks is headquartered in San Francisco.[60] It also has operations in Canada, the United Kingdom, and elsewhere.[61]

References

[edit ]
  1. ^ a b Krystal Hu; Kenrick Cai; Echo Wang (December 13, 2024). "Exclusive: Databricks nears record 9ドル.5 billion VC raise, eyes extra 4ドル.5 billion debt". Reuters . Retrieved December 13, 2024.
  2. ^ Lin, Belle (March 6, 2024). "AI is Driving Record Sales at Multibillion-Dollar Databricks. An IPO Can Wait ..." . The Wall Street Journal . Archived from the original on March 6, 2024.
  3. ^ Jordan, Novet (January 22, 2025). "Meta backs Databricks as the data analytics startup inches toward IPO". CNBC. Archived from the original on January 22, 2025.
  4. ^ Saul, Derek (September 14, 2023). "Top IPO Prospect Databricks Scores 43ドル Billion Valuation Thanks To 500ドル Million Funding Round Including AI Titan Nvidia". Forbes. Archived from the original on September 4, 2024. Retrieved March 26, 2024.
  5. ^ Sullivan, Mark (March 19, 2024). "How Databricks is helping customers develop their own customized AI models". Fast Company. Retrieved March 19, 2024.
  6. ^ Clark, Lindsay (November 16, 2023). "Databricks' lakehouse becomes foundation under fresh layer of AI dreams". The Register. Archived from the original on September 4, 2024. Retrieved November 16, 2023.
  7. ^ "Databricks launches Delta Lake, an open source data lake reliability project". VentureBeat. April 24, 2019. Archived from the original on March 24, 2022. Retrieved April 6, 2021.
  8. ^ Zaharia, Matei. "Matei Zaharia". Archived from the original on March 10, 2014. Retrieved August 16, 2016.
  9. ^ "Microsoft makes Databricks a first-party service on Azure". TechCrunch. November 15, 2017. Archived from the original on September 4, 2024. Retrieved April 6, 2021.
  10. ^ "Databricks brings its lakehouse to Google Cloud". TechCrunch. February 17, 2021. Archived from the original on September 4, 2024. Retrieved February 18, 2021.
  11. ^ a b Konrad, Alex (February 2, 2021). "Databricks Raises 1ドル Billion At 28ドル Billion Valuation, With The Cloud's Elite All Buying In". Forbes. Archived from the original on February 1, 2021. Retrieved July 29, 2021.
  12. ^ "100 Best Large Workplaces for Millennials". Fortune. June 16, 2021. Archived from the original on March 24, 2022. Retrieved July 16, 2021.
  13. ^ Cai, Kenrick (November 16, 2023). "Databricks' New AI Product Adds A ChatGPT-Like Interface To Its Software". Forbes. Archived from the original on September 4, 2024. Retrieved November 16, 2023.
  14. ^ a b "Databricks Is Raising 10ドル Billion, in One of the Largest Venture Capital Deals". December 17, 2024. Archived from the original on December 18, 2024. Retrieved December 19, 2024.
  15. ^ Why AI company Databricks just scored one of the biggest funding rounds in history Fast Company. December 17, 2025
  16. ^ "Databricks acquires Redash, a visualizations service for data scientists". TechCrunch. June 24, 2020. Retrieved April 6, 2021.
  17. ^ Eric Rosenbaum (October 6, 2021). "38ドル billion software start-up Databricks makes acquisition to leave code behind". CNBC. Archived from the original on October 6, 2021. Retrieved February 20, 2022.
  18. ^ Palazzolo, Stephanie (May 3, 2023). "Exclusive: 38ドル billion data and AI darling Databricks acquires security startup Okera" . Business Insider . Archived from the original on May 3, 2023.
  19. ^ Datta, Tiyashi; Hu, Krystal (June 26, 2023). "Databricks strikes 1ドル.3 billion deal for generative AI startup MosaicML". Reuters. Archived from the original on June 26, 2023. Retrieved June 27, 2023.
  20. ^ Council, Stephen (June 26, 2023). "SF tech firm Databricks to buy 2-year-old startup for 21ドル million per employee". SFGATE . Archived from the original on June 26, 2023. Retrieved June 27, 2023.
  21. ^ "After 43ドルB valuation, Databricks acquires data replication startup Arcion for 100ドルM". TechCrunch. October 23, 2023. Retrieved October 23, 2023.
  22. ^ Galloni, Allessandra, ed. (June 5, 2024). "Databricks to buy data management firm Tabular for over 1ドル bln". Reuters .
  23. ^ Hu, Krystal; Nellis, Stephen (March 24, 2023). "Databricks pushes open-source chatbot as cheaper ChatGPT alternative". Reuters. Archived from the original on March 25, 2023.
  24. ^ Loften, Angus (March 24, 2023). "Databricks Launches 'Dolly,' Another ChatGPT Rival" . The Wall Street Journal . Archived from the original on March 24, 2023.
  25. ^ Goldman, Sharon (March 24, 2023). "Databricks debuts ChatGPT-like Dolly, a clone any enterprise can own". VentureBeat . Archived from the original on April 11, 2023.
  26. ^ Wilhelm, Ron Miller and Alex (March 7, 2024). "Databricks keeps marching forward with 1ドル.6B in revenue". TechCrunch. Archived from the original on March 12, 2024. Retrieved March 8, 2024.
  27. ^ a b Harris, Derrick (September 25, 2013). "Databricks raises 14ドルM from Andreessen Horowitz, wants to take on MapReduce with Spark". Archived from the original on January 15, 2022. Retrieved September 28, 2014.
  28. ^ Lorica, Ben (September 25, 2013). "Databricks aims to build next-generation analytic tools for Big Data". O'Reilly Media. Archived from the original on July 4, 2014. Retrieved September 28, 2014.
  29. ^ "Databricks raises 250ドルM at a 2ドル.75B valuation for its analytics platform". TechCrunch. February 5, 2019. Archived from the original on September 4, 2024. Retrieved April 8, 2021.
  30. ^ Novet, Jordan (February 5, 2019). "Microsoft used to scare start-ups but is now an 'outstandingly good partner,' says Silicon Valley investor Ben Horowitz". CNBC. Archived from the original on February 5, 2019. Retrieved April 6, 2021.
  31. ^ Mellor, Chris (September 1, 2021). "Databricks raises data lake of cash at monstrous 380ドルbn valuation". Blocks & Files. Archived from the original on September 1, 2021. Retrieved September 4, 2021.
  32. ^ Miller, Ron (June 30, 2014). "Databricks Snags 33ドルM In Series B And Debuts Cloud Platform For Processing Big Data". TechCrunch. Archived from the original on July 1, 2014. Retrieved September 28, 2014.
  33. ^ Shieber, Jonathan (December 15, 2016). "Databricks raises 60ドル million to be big data's next great leap forward". TechCrunch. Archived from the original on December 15, 2016. Retrieved December 16, 2016.
  34. ^ "Databricks Secures 140ドル Million to Accelerate Analytics and Artificial Intelligence in the Enterprise". Databricks. August 22, 2017. Archived from the original on January 13, 2022. Retrieved May 16, 2019.
  35. ^ "Databricks' 250ドル Million Funding Supports Explosive Growth and Global Demand for Unified Analytics; Brings Valuation to 2ドル.75 Billion". Databricks. February 5, 2019. Archived from the original on January 15, 2022. Retrieved February 5, 2019.
  36. ^ "Databricks announces 400ドルM round on 6ドル.2B valuation as analytics platform continues to grow". TechCrunch. October 22, 2019. Archived from the original on September 4, 2024. Retrieved October 24, 2019.
  37. ^ "Databricks raises 1ドルB at 28ドルB valuation as it reaches 425ドルM ARR". Tech Crunch. February 2021. Archived from the original on November 3, 2021. Retrieved February 14, 2021.
  38. ^ "Databricks raises 1ドル.6B at 38ドルB valuation as it blasts past 600ドルM ARR". Tech Crunch. Archived from the original on December 30, 2021. Retrieved July 1, 2021.
  39. ^ Nishant, Niket; Hu, Krystal (September 14, 2023). "Databricks raises over 500ドル mln at 43ドル bln valuation". Reuters. Retrieved September 20, 2023.
  40. ^ Tan, Huileng (December 18, 2024). "Databricks is raising a gigantic funding round". Business Insider.
  41. ^ Michael, Armbrust; Ghodsi, Ali; Xin, Reynold; Zaharia, Matei (January 2021). "Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics" (PDF). Conference on Innovative Data Systems Research . Archived (PDF) from the original on December 22, 2020. Retrieved July 29, 2021.
  42. ^ "With massive 1ドルB infusion, Databricks takes aim at IPO and rival Snowflake". SiliconANGLE. February 1, 2021. Archived from the original on April 6, 2023. Retrieved April 8, 2021.
  43. ^ Simone, Stephanie (October 17, 2022). "Databricks achieves FedRAMP Authorized status". KMWorld. Information Today. Archived from the original on October 20, 2022. Retrieved October 20, 2022.
  44. ^ "The Two Sigma Ventures Open Source Index". Two Sigma Ventures. Archived from the original on November 29, 2022. Retrieved April 8, 2021.
  45. ^ "MLOps Tools - Ranking. OSS Insight". OSS Insight. Archived from the original on September 4, 2024. Retrieved April 3, 2024.
  46. ^ "Databricks Cranks Delta Lake Performance, Nabs Redash for SQL Viz". Datanami. June 24, 2020. Archived from the original on July 9, 2020. Retrieved April 8, 2021.
  47. ^ "Databricks launches Delta Lake, an open source data lake reliability project". VentureBeat. April 24, 2019. Archived from the original on March 24, 2022. Retrieved April 8, 2021.
  48. ^ https://holistics.io/
  49. ^ https://www.sigmacomputing.com
  50. ^ "Databricks launches SQL Analytics". TechCrunch. November 12, 2020. Archived from the original on September 4, 2024. Retrieved April 8, 2021.
  51. ^ Brust, Andrew. "Databricks, champion of data "lakehouse" model, closes 1ドルB series G funding round". ZDNet. Archived from the original on February 1, 2021. Retrieved April 8, 2021.
  52. ^ "Riding the data-powered AI wave: Inside Databricks' unified stack solution". Databricks. March 14, 2024. Archived from the original on September 4, 2024. Retrieved April 5, 2024.
  53. ^ "Databricks open-sources its own large language model, DBRX". Databricks. March 27, 2024. Archived from the original on April 5, 2024. Retrieved April 5, 2024.
  54. ^ "Inside the Creation of the World's Most Powerful Open Source AI Model". Databricks. March 27, 2024. Archived from the original on September 4, 2024. Retrieved April 5, 2024.
  55. ^ "Databricks' new open-source AI model could offer enterprises a leaner alternative to OpenAI's GPT-3.5". Databricks. March 27, 2024. Archived from the original on September 4, 2024. Retrieved April 5, 2024.
  56. ^ "Databricks to run two massive online courses on Apache Spark". Databricks. December 2, 2014. Archived from the original on January 13, 2022. Retrieved December 16, 2016.
  57. ^ "Data + AI Summit". Databricks. Archived from the original on April 23, 2022. Retrieved April 8, 2021.
  58. ^ Highlights from DATA+AI Summit 2021 Towards Data Science. June 27, 2021
  59. ^ Novet, Jordan (December 4, 2024). "Amazon rolls out Buy with AWS button to let software vendors more easily sell to its cloud customers". CNBC. Retrieved December 8, 2024.
  60. ^ staff, CNBC com (June 16, 2020). "36. Databricks". CNBC. Archived from the original on December 24, 2022. Retrieved April 8, 2021.
  61. ^ "Worldwide locations". Archived from the original on June 7, 2023. Retrieved October 20, 2022.

AltStyle によって変換されたページ (->オリジナル) /