Overview
The CSV Connector provides functionality to retrieve data from CSV files and register it in the Fess index.
This feature requires the fess-ds-csv plugin.
Prerequisites
Plugin installation is required
Access to the CSV file is required
You must know the character encoding of the CSV file
Plugin Installation
Method 1: Place JAR file directly
Method 2: Install from admin console
Open “System” -> “Plugins”
Upload the JAR file
Restart Fess
Configuration
Configure from admin console via “Crawler” -> “Data Store” -> “Create New”.
Basic Settings
| Item | Example |
|---|---|
| Name | Products CSV |
| Handler Name | CsvDataStore |
| Enabled | On |
Parameter Settings
Local file:
HTTP file:
Multiple files:
Parameter List
Script Settings
With header row:
Without header row (column index):
Available Fields
data.<column_name>- Header row column name (when has_header_line=true)data.cell<N>- Column index (when has_header_line=false, 1-based:cell1,cell2…)
CSV Format Details
Standard CSV (RFC 4180 compliant)
Changing Separator
Tab-separated (TSV):
Semicolon-separated:
Custom Quote Character
Single quote:
Encoding
Japanese file (Shift_JIS):
Japanese file (EUC-JP):
Usage Examples
Product Catalog CSV
CSV file (products.csv):
Parameters:
Script:
Filtering by stock status:
Employee Directory CSV
CSV file (employees.csv):
Parameters:
Script:
CSV Without Header
CSV file (data.csv):
Parameters:
Script:
Multiple CSV Files Integration
Parameters:
Script:
Fetch CSV from HTTP
Parameters:
Script:
Tab-Separated (TSV) File
TSV file (data.tsv):
Parameters:
Script:
Troubleshooting
File Not Found
Symptom: FileNotFoundException or No such file
Check:
Verify the file path is correct (absolute path recommended)
Verify the file exists
Verify read permissions on the file
Verify Fess user can access the file
Character Encoding Issues
Symptom: Japanese or other characters not displayed correctly
Solution:
Specify the correct character encoding:
Check file encoding:
Columns Not Recognized Correctly
Symptom: Column separation not recognized correctly
Check:
Verify separator is correct:
Verify quote character setting
Verify CSV file format (RFC 4180 compliant)
Header Row Handling
Symptom: First row is recognized as data
Solution:
When header row exists:
When header row does not exist:
No Data Retrieved
Symptom: Crawl succeeds but count is 0
Check:
Verify CSV file is not empty
Verify script settings are correct
Verify column names are correct (when has_header_line=true)
Check logs for error messages
Large CSV Files
Symptom: Out of memory or timeout
Solution:
Split CSV file into multiple files
Use only necessary columns in script
Increase Fess heap size
Filter unnecessary rows
Fields with Line Breaks
RFC 4180 format allows handling fields with line breaks by enclosing in quotes:
Parameters:
Advanced Script Examples
Data Processing
Conditional Indexing
Combining Multiple Columns
Date Formatting
Reference
Data Store Connector Overview - DataStore Connector Overview
JSON Connector - JSON Connector
Database Connector - Database Connector
Data Store Crawling - Data Store Configuration Guide