Overview
SequenceConnector generates a configurable number of empty Documents with sequential numeric IDs. This connector is primarily used for testing pipelines, load testing, and development scenarios where you need a controlled number of documents without external data sources.
Location: com.kmwllc.lucille.connector.SequenceConnector
Use Cases
- Pipeline testing: Verify stage transformations with known document counts
- Load testing: Generate large volumes of documents to test throughput
- Development: Test pipeline configurations without external dependencies
- Benchmarking: Measure stage performance with controlled inputs
Configuration Parameters
Total number of Documents to create.
First ID value to use. Document IDs will be sequential starting from this value.
The name of the connector instance.
Must be
com.kmwllc.lucille.connector.SequenceConnector.The name of the pipeline to send documents to.
Prefix to add to all document IDs.
Examples
Basic Usage
Generate 1000 empty documents with IDs 0-999:Custom Start Value
Generate documents with IDs starting from 5000:With Document ID Prefix
Add a namespace prefix to all document IDs:Load Testing Pipeline
Test a pipeline with 1 million documents:Document Structure
Documents created by SequenceConnector contain only:- Document ID (sequential number with optional prefix)
- Standard metadata fields (run_id, connector_name)
Performance
SequenceConnector is extremely fast since it doesn’t read from external sources:- Can generate millions of documents per second
- Zero I/O overhead
- Ideal baseline for benchmarking stage performance
Common Patterns
Testing Stage Transformations
Combine with random data stages to test transformations:Comparing Pipeline Performance
Test multiple pipelines with identical document loads:Next Steps
Document Generation Guide
Generate realistic test data with random stages
FileConnector
Process real files from local or cloud storage