Getting Started
Overview
kETL is a privacy-first platform that helps scientists and researchers create custom data processing tools without coding. Your data never leaves your premises, and all processing happens locally.
Key Benefits: Complete data privacy • No coding required • Transparent processing • Regulatory compliance • Custom to your needs
Quick Start
- 1Describe Your Data
Tell kETL about your data structure using plain language or upload a sample file
- 2Configure Processing
Choose transformations and output formats for your data
- 3Generate Tool
kETL creates a custom executable tailored to your requirements
- 4Process at Scale
Run your tool on thousands of files locally with full control
Creating Your First Tool
Let's walk through creating your first ETL tool with kETL:
Step 1: Start Discovery
Click "Start Discovery" and describe your data. For example:
"I have CSV files with columns: Sample_ID, Concentration_mg/mL, Temperature_C, Time_hr, and OD600"Step 2: Review Data Shape
kETL will automatically detect column types and units. Review and adjust as needed using the data shape builder.
Step 3: Select Modules
Choose processing modules like data validation, unit conversion, statistical analysis, or visualization based on your needs.
Data Discovery
Discovery Flow
kETL uses an intelligent discovery system to understand your data:
- Natural language description of your data structure
- Automatic detection of column types and units
- Pattern recognition for scientific data formats
- Interactive refinement through guided questions
Column Types
kETL recognizes various scientific data types:
Numeric Types
- • Measurements with units
- • Concentrations
- • Temperatures
- • Time points
- • Percentages
Categorical Types
- • Sample identifiers
- • Treatment groups
- • Batch numbers
- • Experimental conditions
- • Quality flags
Data Shapes
kETL supports various scientific data formats:
Tabular Data
Standard row-column format with headers (CSV, Excel, TSV)
Plate Reader Data
96/384-well plate formats with layout information
Time Series Data
Measurements over time with regular or irregular intervals
Tool Generation
Available Modules
kETL provides pre-built modules for common scientific data operations:
Data Validation
Check data integrity, detect outliers, validate ranges
Data Transformation
Unit conversion, normalization, aggregation
Statistical Analysis
Basic statistics, curve fitting, correlation analysis
Visualization
Charts, heatmaps, dose-response curves
Customization Options
Tailor your tool to your specific needs:
- Configure input file patterns and formats
- Set validation rules and thresholds
- Choose output formats and naming conventions
- Add custom calculations and transformations
Export Formats
Generated tools support multiple output formats:
CSV/Excel
Tabular data
JSON
Structured data
PDF Reports
With charts
Privacy & Security
Data Privacy
Your Data Never Leaves Your Premises
kETL only needs to understand your data structure, not see your actual data. All processing happens locally on your machines.
- No data upload required
- No cloud processing
- No external API calls with your data
On-Premise Execution
Generated tools are standalone executables that run entirely on your infrastructure:
Standalone Executables
No internet connection required after download
Source Code Access
Premium users can review and modify the generated code
Audit Trail
Complete logging of all operations for compliance
Compliance
kETL is designed with regulatory compliance in mind:
FDA 21 CFR Part 11
- • Electronic signatures
- • Audit trails
- • Data integrity
GDPR Compliant
- • Data minimization
- • Privacy by design
- • User control
Frequently Asked Questions
General Questions
What file formats does kETL support?
kETL supports CSV, Excel (xlsx/xls), TSV, and other delimited text formats. We're continuously adding support for more scientific data formats.
How many tools can I generate?
Your first tool is free. After that, tools are £5 each. There's no limit to how many tools you can create.
Can I modify the generated tools?
Premium users have access to the source code and can modify it as needed. Standard users receive compiled executables.
Technical Questions
What programming language are tools generated in?
Tools are generated in Python and packaged as standalone executables using PyInstaller. No Python installation is required to run them.
What operating systems are supported?
Generated tools run on Windows, macOS, and Linux. We provide platform-specific executables for each operating system.
How large can my data files be?
Generated tools can handle files up to several GB in size. For very large datasets, tools process data in chunks to manage memory efficiently.
Troubleshooting
Need Help?
If you encounter any issues, please contact support at support@ketl.asae.bio