InfoQ Homepage Presentations Exploring Wikipedia with Apache Spark: A Live Coding Demo
Exploring Wikipedia with Apache Spark: A Live Coding Demo
Summary
Sameer Farooqui demos connecting to the live stream of Wikipedia edits, building a dashboard showing what’s happening with Wikipedia datasets and how people are using them in real time.
Bio
Sameer Farooqui is a Technology Evangelist at Databricks where he focuses on enabling Spark deployments via tech support, consulting and training. Before that, Sameer was a Systems Architect at Hortonworks and an Enterprise Solutions Specialist at Symantec. He is also a regular speaker at various big data conferences such as Strata + Hadoop World, Cassandra Summit and Big Data Tech Con.
About the conference
Chariot Solutions is a software development consulting firm. We build and integrate the critical software applications that run our clients’ businesses. We are successful because we attract the most talented and collaborative software architects in the region. They are leaders in Java, open source and emerging technologies. We work in small, agile teams. We solve hard problems with a practical approach centered on communication, common sense and continual learning. We believe it is important to give back to our community through shared learning.
This content is in the AI, ML & Data Engineering topic
Related Topics:
Sponsored Content
-
Related Editorial
-
Related Sponsors
-
Popular across InfoQ
-
AWS Introduces ECS Managed Instances for Containerized Applications
-
Producing a Better Software Architecture with Residuality Theory
-
GitHub Introduces New Embedding Model to Improve Code Search and Context
-
Google DeepMind Introduces CodeMender, an AI Agent for Automated Code Repair
-
Building Distributed Event-Driven Architectures across Multi-Cloud Boundaries
-
Elena Samuylova on Large Language Model (LLM)-Based Application Evaluation and LLM as a Judge
-