Overview
This Alli Workflows script removes any rows from a .csv file that contain null (missing) values in specified columns. It helps ensure cleaner datasets for downstream automation steps such as enrichment, transformation, or upload.
Variables
|
Name |
Reference |
Type |
Required |
Default |
Options |
Description |
|---|---|---|---|---|---|---|
|
Source Folder Name |
|
Alphanumeric |
|
|
|
Path to the folder containing your CSV file. Defaults to the current directory ( |
|
Source Filename |
|
Alphanumeric |
✅ |
|
|
Name of the CSV file to clean. Must include the Note: Output will overwrite this file with the filtered version. |
|
Column Name(s) |
|
Alphanumeric |
✅ |
|
|
Comma-separated list of column names to check for nulls (e.g. |
Example Values in Alli Workflows:
|
Variable |
Example Value |
|---|---|
|
|
|
|
|
|
|
|
|
What the Script Does
-
Reads Input File
UsesFOLDER_NAMEandFILE_NAMEto locate the.csvfile. Defaults to the working directory ifFOLDER_NAMEis not set. -
Validates Configuration
-
Ensures
FILE_NAMEis provided and ends in.csv. -
Verifies that
COLUMN_NAMEis populated.
-
-
Parses Columns & Drops Null Rows
-
Converts
COLUMN_NAMEinto a list. -
Filters out any rows that contain
nullvalues in any of the specified columns.
-
-
Overwrites the Original File
The filtered data is saved back to the same path for use in the next step. -
Logs Result Summary
The script prints:pgsqlCopyEdit
Filtered X row(s) with nulls in [columns]. Saved cleaned CSV back to /path/to/file.csv
Requirements
-
A valid
.csvfile must be present, either uploaded in a prior step or passed from an upstream task. -
Column names in
COLUMN_NAMEmust match exactly (case-sensitive) with the CSV headers.
Typical Use Cases
-
Removing incomplete rows from product feeds before AI enrichment or scoring.
-
Ensuring clean survey or form response data before analytics.
-
Validating required fields prior to exporting to external systems.
Alli Workflows Tips
-
Use an Upload File step before this script to provide the
.csvfile if it’s not generated in a previous task. -
Chain this task before model inference, data export, or QA automation steps.
-
Review logs for details on filtered rows and affected columns.