Split huge CSV by columns with Miller

I need to split huge (>1 Gb) CSV files containing 50K+ columns each on a daily basis.

I've found Miller as an interesting and performant tool for such a task.

But I'm stuck on Miller's documentation.

How could I split one CSV to N smaller CSV files where N is a number of rows in my source file?

Solution

try with this script

mlr --csv put -S 'if (NR % 10000 == 0) {$rule=NR} else {$rule = ""}' \
then fill-down -f rule \
then put -S 'if ($rule=="") {$rule="0"}' \
then put -q 'tee > $rule.".csv", $*' input.csv

Make a copy of your CSV in a new folder, and then run this script on it. It will produce a csv file for every 10000 rows.

How memory address for pointer to arrays is same as an element in 2D array?
How to use ellipsis in c's case statement?
Fast & accurate atan/arctan approximation algorithm
How can I exclude non-numeric keys? CS50 Caesar Pset2
Fast ceiling of an integer division in C / C++
Is there an invalid pthread_t id?
How does SIMD (avx) processing work? for example, if I want 10 32 bit floats how do i fit in a 256 bit avx vector?
FDCAN problems on STM32G4
How does the call macro enable mutual recursion between functions f and g in this Hanoi Tower implementation?
Running test on Rocket core CPU - global variable initialized to 0 is unsuccessful, output wrong value instead
Interacting with C arrays without knowing the size
Combination of two strings
Avoiding strcpy overflow destination warning
carriage return by fgets
How to use special characters in C?
Why does 1.0/100.0 == 0.1/10.0 give True?
Is it correct to compare pointers in C?
Force free() to return malloc memory back to OS
How can I print to standard error in C with 'printf'?
What is the standard behavior of fread in C on Windows?
How is strtok removing lines it shouldn't have access to?
Using array as smart point in C
Assigning string to malloced 2d char array not working as intended
How to refactor repetition inside a Makefile?
Why does an empty preprocessor command still evaluate to something?
How to implement variable sized array within C struct
Character array typecasting to integer
Handling HTTP Headers in a Minimal C HTTP Server
How to get the sign, mantissa and exponent of a floating point number
Why do MCU libraries use logic operations instead of bitfield structs?