Documentation

Everything you need to know about kETL

Back to Home

Getting Started

Overview

kETL is a privacy-first platform that helps scientists and researchers create custom data processing tools without coding. Your data never leaves your premises, and all processing happens locally.

Key Benefits: Complete data privacy • No coding required • Transparent processing • Regulatory compliance • Custom to your needs

Quick Start

  1. 1
    Describe Your Data

    Tell kETL about your data structure using plain language or upload a sample file

  2. 2
    Configure Processing

    Choose transformations and output formats for your data

  3. 3
    Generate Tool

    kETL creates a custom executable tailored to your requirements

  4. 4
    Process at Scale

    Run your tool on thousands of files locally with full control

Creating Your First Tool

Let's walk through creating your first ETL tool with kETL:

Step 1: Start Discovery

Click "Start Discovery" and describe your data. For example:

"I have CSV files with columns: Sample_ID, Concentration_mg/mL, Temperature_C, Time_hr, and OD600"

Step 2: Review Data Shape

kETL will automatically detect column types and units. Review and adjust as needed using the data shape builder.

Step 3: Select Modules

Choose processing modules like data validation, unit conversion, statistical analysis, or visualization based on your needs.

Data Discovery

Discovery Flow

kETL uses an intelligent discovery system to understand your data:

  • Natural language description of your data structure
  • Automatic detection of column types and units
  • Pattern recognition for scientific data formats
  • Interactive refinement through guided questions

Column Types

kETL recognizes various scientific data types:

Numeric Types

  • • Measurements with units
  • • Concentrations
  • • Temperatures
  • • Time points
  • • Percentages

Categorical Types

  • • Sample identifiers
  • • Treatment groups
  • • Batch numbers
  • • Experimental conditions
  • • Quality flags

Data Shapes

kETL supports various scientific data formats:

Tabular Data

Standard row-column format with headers (CSV, Excel, TSV)

Plate Reader Data

96/384-well plate formats with layout information

Time Series Data

Measurements over time with regular or irregular intervals

Tool Generation

Available Modules

kETL provides pre-built modules for common scientific data operations:

Data Validation

Check data integrity, detect outliers, validate ranges

Data Transformation

Unit conversion, normalization, aggregation

Statistical Analysis

Basic statistics, curve fitting, correlation analysis

Visualization

Charts, heatmaps, dose-response curves

Customization Options

Tailor your tool to your specific needs:

  • Configure input file patterns and formats
  • Set validation rules and thresholds
  • Choose output formats and naming conventions
  • Add custom calculations and transformations

Export Formats

Generated tools support multiple output formats:

CSV/Excel

Tabular data

JSON

Structured data

PDF Reports

With charts

Privacy & Security

Data Privacy

Your Data Never Leaves Your Premises

kETL only needs to understand your data structure, not see your actual data. All processing happens locally on your machines.

  • No data upload required
  • No cloud processing
  • No external API calls with your data

On-Premise Execution

Generated tools are standalone executables that run entirely on your infrastructure:

Standalone Executables

No internet connection required after download

Source Code Access

Premium users can review and modify the generated code

Audit Trail

Complete logging of all operations for compliance

Compliance

kETL is designed with regulatory compliance in mind:

FDA 21 CFR Part 11

  • • Electronic signatures
  • • Audit trails
  • • Data integrity

GDPR Compliant

  • • Data minimization
  • • Privacy by design
  • • User control

Frequently Asked Questions

General Questions

What file formats does kETL support?

kETL supports CSV, Excel (xlsx/xls), TSV, and other delimited text formats. We're continuously adding support for more scientific data formats.

How many tools can I generate?

Your first tool is free. After that, tools are £5 each. There's no limit to how many tools you can create.

Can I modify the generated tools?

Premium users have access to the source code and can modify it as needed. Standard users receive compiled executables.

Technical Questions

What programming language are tools generated in?

Tools are generated in Python and packaged as standalone executables using PyInstaller. No Python installation is required to run them.

What operating systems are supported?

Generated tools run on Windows, macOS, and Linux. We provide platform-specific executables for each operating system.

How large can my data files be?

Generated tools can handle files up to several GB in size. For very large datasets, tools process data in chunks to manage memory efficiently.

Troubleshooting

Need Help?

If you encounter any issues, please contact support at support@ketl.asae.bio