Installation
This guide will walk you through installing Lucille and setting up your development environment.Prerequisites
Before installing Lucille, make sure you have the following prerequisites installed:Required Software
Java 17+
Lucille requires Java Development Kit (JDK) version 17 or later
Maven 3.6+
Apache Maven is required to build Lucille from source
Verify Prerequisites
Check that you have the required software installed:Installing Java 17
If you don’t have Java 17 installed, follow the instructions for your operating system:macOS
macOS
Using Homebrew:Then set Add this to your
JAVA_HOME:~/.zshrc or ~/.bash_profile to make it permanent.Linux (Ubuntu/Debian)
Linux (Ubuntu/Debian)
JAVA_HOME:~/.bashrc to make permanent.Linux (RHEL/CentOS)
Linux (RHEL/CentOS)
JAVA_HOME:Windows
Windows
Installing Maven
If you don’t have Maven installed:macOS
macOS
Using Homebrew:
Linux
Linux
Ubuntu/Debian:RHEL/CentOS:
Windows
Windows
- Download Maven from Apache Maven
- Extract the archive to a directory (e.g.,
C:\Program Files\Apache\maven) - Add Maven’s
bindirectory to yourPATHenvironment variable
Installing Lucille
Now that you have the prerequisites, let’s install Lucille:Build Lucille
Build Lucille and all its modules using Maven:This command will:
- Compile all Java source code
- Run unit tests
- Package JARs for all modules
- Install artifacts to your local Maven repository (
~/.m2/repository)
The first build may take 5-10 minutes as Maven downloads all dependencies. Subsequent builds will be much faster.
Project Structure
After installation, you’ll find the following directory structure:Verify Your Installation
Let’s verify that Lucille is properly installed by running a simple example:The
-validate flag checks your configuration without running the connector. This is useful for catching errors before starting a long-running job.Optional: Install Search Engines
Depending on your use case, you may want to install one or more search engines:Apache Solr
Install Solr
Install Solr
- Download Solr from Apache Solr Downloads
- Extract the archive:
- Start Solr:
- Create a collection:
- Verify: Open http://localhost:8983/solr in your browser
Elasticsearch
Install Elasticsearch
Install Elasticsearch
- Download Elasticsearch from Elastic Downloads
- Extract and start:
- Verify:
curl http://localhost:9200
OpenSearch
Install OpenSearch
Install OpenSearch
Configuration Tips
Setting Memory Limits
For large datasets, you may need to increase Java heap size:-Xmx4g: Maximum heap size of 4 GB-Xms2g: Initial heap size of 2 GB
Logging Configuration
Lucille uses SLF4J with Logback for logging. You can customize logging by creating alogback.xml file:
logback.xml
Troubleshooting
Maven build fails with 'diamond operator not supported'
Maven build fails with 'diamond operator not supported'
This means you’re using Java 8 or earlier. Lucille requires Java 17+:If you have multiple Java versions, set
JAVA_HOME to point to Java 17.Maven build fails with 'peer not authenticated' or SSL errors
Maven build fails with 'peer not authenticated' or SSL errors
This may be a corporate proxy or firewall issue. Try:Or configure Maven to use your corporate proxy in
~/.m2/settings.xml.Tests fail during build
Tests fail during build
Some tests require external services. You can skip tests during installation:
OutOfMemoryError during build
OutOfMemoryError during build
Increase Maven’s memory:
Cannot find 'lucille-core' dependency
Cannot find 'lucille-core' dependency
Make sure you ran
mvn clean install from the top-level lucille/ directory. This installs all modules to your local Maven repository.Next Steps
Now that you have Lucille installed, you’re ready to start building ETL pipelines:Quickstart
Follow the quickstart guide to run your first Lucille example
Connectors
Learn about the available data source connectors
Pipelines
Discover the transformation stages you can use in pipelines
Indexers
Configure indexers for different search engines and databases