Deborah.DeborahCore.XYMLInfoGenerator
Deborah.DeborahCore.XYMLInfoGenerator.gen_XY_ML_info — Methodgen_XY_ML_info(
X_info::Array{T,3},
conf_arr::Vector{Int},
lb_idx::Vector{Int},
tr_idx::Vector{Int},
bc_idx::Vector{Int},
ul_idx::Vector{Int},
N_lb::Int, N_tr::Int, N_bc::Int, N_ul::Int,
name_prefix::String, overall_name::String,
analysis_dir::String,
read_column::Int;
use_avg::Bool=true, dump::Bool=true,
jobid::Union{Nothing, String}=nothing
) -> Tuple{
Array{T,2}, Array{T,2}, Array{T,2}, Array{T,2},
Vector{Int}, Vector{Int}, Vector{Int}, Vector{Int}
} where T<:RealSplit a full 2D input matrix X_info into four named subsets (LB, TR, BC, UL) and optionally write them to .dat files.
Arguments
X_info::Array{T,2}Input matrix of shape $(N_\text{cnf}, N_\text{src})$, typically representing a single observable across configs and sources.conf_arr::Vector{Int}Full list of configuration indices ($\text{length} = N_\text{cnf}$), aligned with the rows ofX_info.lb_idx::Vector{Int}Index list for the Labeled (LB) set.tr_idx::Vector{Int}Index list for the Training (TR) set.bc_idx::Vector{Int}Index list for the Bias Correction (BC) set.ul_idx::Vector{Int}Index list for the Unlabeled (UL) set.N_lb::Int,N_tr::Int,N_bc::Int,N_ul::IntNumber of configurations in each group (used for preallocation and checks).name_prefix::StringPrefix used in output file names.overall_name::StringGlobal name used for logging and file identification.analysis_dir::StringOutput directory path where files will be dumped (ifdump=true).read_column::Int$1$-based index of the column used to extract the observable (for file naming or meta info).use_avg::Bool=trueIf true, marksjvalas-1in the output file to signal "averaged" content.dump::Bool=trueIf true, writes.datfiles toanalysis_dir.jobid::Union{Nothing,String}Optional job ID for logging or progress tracking.
Returns
Tuple of eight elements:
X_lb_info :: Array{T,3}X_tr_info :: Array{T,3}X_bc_info :: Array{T,3}X_ul_info :: Array{T,3}lb_conf_arr:: Vector{Int}tr_conf_arr:: Vector{Int}bc_conf_arr:: Vector{Int}ul_conf_arr:: Vector{Int}
Each X_*_info has shape $(\texttt{column\_idx}, N_\text{set}, N_\text{src})$ and corresponds to the respective index group.