Your submission was sent successfully! Close

You have successfully unsubscribed! Close

Thank you for signing up for our newsletter!
In these regular emails you will find the latest updates about Ubuntu and upcoming events where you can meet our team.Close

Apache Bigtop and Juju: a charming approach to big data

This article was last updated 7 years ago.


Juju users have been enjoying our collection of Big Data charms for over two years. During this time, we’ve learned a lot about what our users want from this complex corner of big software.

The Apache Bigtop community distills best practices for installing big data software. They extensively test and package Apache big data projects to ensure users are able to manage their deployments. This community also provides a foundation for other projects and products to build from.

Juju charms distill the operational intelligence needed to run big software such as Openstack, Kubernetes, and the Bigtop stack across clouds and architectures. Charms provide an Open Source service-oriented approach to sharing how complex software should be modeled. By modeling Bigtop with charms, we can rapidly deploy and test complex solutions at scale (and across clouds). These tests uncover hard problems, which leads to community collaboration, which leads to making individual Bigtop components better. Bigtop charms allow operators and developers to use, test, and benchmark the Bigtop stack on their laptops, bare metal, or favorite cloud.

Every time a Bigtop charm gets better, Bigtop also gets better – and vice versa. Since Juju can repeatedly deploy charms on multiple clouds and architectures, it allows us to quickly identify issues in individual components as well the relationships to other components in the Bigtop stack.

Today we are thrilled to announce the Bigtop charms, bundles, and test plans live alongside the Bigtop source!

The source layers for the charms for the Apache Hadoop component of Bigtop, along with instructions for building the charms from these layers, can now be found in the Apache Bigtop repository at: https://github.com/apache/bigtop/tree/master/bigtop-packages/src/charm

The Bigtop repository also includes the hadoop-processing bundle, which encapsulates how to deploy and relate these charms to get a fully-functioning, scalable Hadoop cluster in minutes: https://github.com/apache/bigtop/tree/master/bigtop-deploy/juju/hadoop-processing

Additionally, we included a test plan for the Cloud Weather Report project to run the Bigtop solutions across multiple clouds and report back testing and benchmarking results: https://github.com/apache/bigtop/tree/master/bigtop-tests/cloud-weather-report

Jump in and submit a pull request for any of those charms or bundles. If there is a specific workload that you would like to share with others, please mail the Bigtop Dev List. To see current ideas, take a look at our Apache Bigtop Charming Effort community wiki and/or chime in on the in-progress JIRA bugs: https://goo.gl/Hhda2M

If you already have Juju 2 installed, give the latest Bigtop Hadoop bundle a try with:

juju deploy hadoop-processing

Or deploy the bundle using the card below:

View details

hadoop-processing

by bigdata-charmers

If you’re new to Juju, follow the getting started instructions to start developing on these Bigtop solutions. If you need help with that, we’d be happy to walk you through the process on the Juju mailing list.

For those of you looking for some face to face training, join us at the next Juju Charmer Summit September 12th-14th, 2016 in Pasadena, CA for charm development, design, and best practices. It’s free for anyone to attend.

The Bigtop community is vibrant, collaborative, and friendly — a community we are excited to be a part of to help make Apache big data software better!

Ubuntu cloud

Ubuntu offers all the training, software infrastructure, tools, services and support you need for your public and private clouds.

Newsletter signup

Get the latest Ubuntu news and updates in your inbox.

By submitting this form, I confirm that I have read and agree to Canonical's Privacy Policy.

Related posts

What is a Kubernetes operator?

Kubernetes is the open source, industry-standard platform for deploying, managing and scaling containerized applications – and applications on Kubernetes are...

Operate popular open source on Kubernetes – Attend Operator Day at KubeCon EU 2024

Operate popular open source on Kubernetes – Attend Operator Day at KubeCon EU 2024

Understanding roles in software operators

In today’s blog we take a closer look at roles – the key elements that make up the design pattern – and how they work together to simplify maintaining...