How do I split a custom dataset into training and test datasets in Flux.jl?

I have a custom dataset and I would like to split that dataset into a "training" and "test" set (also potentially a "validation" set if possible). How would I achieve this using Flux.jl or other Julia machine learning packages?

Solution

You can import the TrainTestSplit function from the Lathe package, as in:

using Lathe.preprocess: TrainTestSplit

and then implement it in your code like this for example:

dataset_id = TrainTestSplit(datasetmap[:], 0.8); #datasetmap is your label encoded matrix

Am assuming you're using Pluto notebook but, it should work in any other environment as well i,e jupyter, atom, etc.

Convert binary to decimal in Julia
Multidimensional matrix permutation Julia vs. Python disagreement
numpy.einsum for Julia? (2)
VSCode cannot find the Julia executable that exists
Julia PLSQ - Integer Relationship Algorithm - anyone used it in anger?
StackOverflowError when constucting a struct from another in Julia
Understanding parametric types and their super-types in Julia
Unexpected memory allocation when using array views (julia)
Dictionaries vs NamedTuples
Indexing in Julia
Getting the Expression Tree of the Body of a Function in Julia
In julia, which is more performant a >= b && c <= b vs !(a<b || c>b)
Gen: How to combine multiple generative function traces in a higher-order generative function?
How to plot a vector field in Julia?
Init or main function in Julia
How to fill area between curves in 3D plot in Julia using Makie?
Post API in Julia when the input is a JSON
In Julia, how to convert a unsigned number to a signed number like in C?
How can I change the arrow size in a plot?
Increasing the performance of dot product calculation Julia Dataframe
Failed to Precompile IJulia
Creating a vector longer than 25 elements in Julia seems to fail
Is there any way to build package dependency tree in julia-lang?
Negative exponent in plot in Julia
How to select elements from array in Julia matching predicate?
What is the correct way to save and retrieve dictionaries in Julia?
Is there a way to rename the bands of a Raster in Rasters.jl?
Getting MethodError: no method matching *(::Vector{Float64}, ::Vector{Float64}) on julia FractionalDiffEq
Wrong function call when solving for steady state using julia's NonlinearSolve
Plotting Multiple Unrelated Datasets Algebra of Graphics