Lighthouse is an open-source project to build and manage data lakes. Lighthouse is built on top of Scala and Spark. It is being developed by Data Minded and it is based on our experience building data lakes at large organisations.

You can find the source code for Lighthouse at GitHub: http://github.com/datamindedbe/lighthouse