We ❤️ Open Source

A community education resource

How strong data governance enables AI and machine learning growth

Why data governance is key to scaling innovation in open source and beyond.

Lauren Maffeo, an experienced open source contributor, sat down with the All Things Open team to share her insights on data governance and its crucial role in driving AI growth and innovation. With over six years of involvement in open source, particularly in the Drupal ecosystem and data communities, she highlights how data governance is often misunderstood as just a compliance tool, rather than a key enabler of innovation. 

Subscribe to our All Things Open YouTube channel to get notifications when new videos are available.

Lauren compares effective data governance to the “bones” of a house—though invisible, it is essential for building a strong, scalable data infrastructure. She explains that the lack of solid data governance has hindered the growth of open data, unlike the rapid advancements seen in open source code.

In addition to discussing the importance of data governance, Lauren talks about her book, a quick guide designed to help senior leaders manage data effectively. Born from conversations at the All Things Open (ATO) conference in 2019, the book aims to provide actionable steps for tackling data management challenges, from tagging to system integration. Lauren also emphasizes the need to think holistically about data governance, focusing on creating value through data products and leveraging open source tools like Kafka streaming to enhance data environments.

Finally, Lauren encourages open source communities to adopt collaborative practices in data governance, much like they have in software development. By bringing people together to share solutions, open source principles can help manage and govern data at scale, enabling the creation of innovative, shared data products.

Key takeaways

  • Data governance drives innovation: Effective governance lays the groundwork for advanced data uses, like AI and machine learning, by ensuring quality and accessibility.
  • Leverage open source practices for data: Open source communities are well-equipped to manage data collaboratively, driving innovation and creating shared solutions.
  • Focus on value and holistically thinking: Turn your data into valuable, reusable products that can enhance business outcomes and enable experimentation.

Conclusion

Lauren’s insights highlight the importance of data governance in unlocking the full potential of data. By focusing on the right tools, processes, and people, organizations can lay the foundation for AI, machine learning, and other data-driven innovations. Drawing on open source principles, communities can collaborate to create valuable, shared data products. The All Things Open conference provides a unique opportunity to connect with like-minded individuals, transforming ideas into actionable solutions.

About the Author

The ATO Team is a small but skilled team of talented professionals, bringing you the best open source content possible.

Read the ATO Team's Full Bio

The opinions expressed on this website are those of each author, not of the author's employer or All Things Open/We Love Open Source.

Want to contribute your open source content?

Contribute to We ❤️ Open Source

Help educate our community by contributing a blog post, tutorial, or how-to.

This year we're hosting two world-class events!

Join us for AllThingsOpen.ai, March 17-18, and All Things Open 2025, October 12-14.

Open Source Meetups

We host some of the most active open source meetups in the U.S. Get more info and RSVP to an upcoming event.