Skip to main content

Converters

All registered Converter components in DashAI.

NameDescription
AdditiveChi2SamplerApproximate the additive chi-squared kernel using Fourier sampling.
BagOfWordsConverterConvert raw text documents into a matrix of token occurrence counts.
BinarizerThreshold each feature to produce binary (0/1) values.
CharacterReplacerReplace or remove a character or substring in all selected text columns.
ColumnRemoverRemove the columns selected in scope from the dataset.
EmbeddingHuggingFace embedding converter.
FastICADecompose features into statistically independent components using FastICA.
GenericUnivariateSelectSelect features using a configurable univariate statistical test and mode.
IncrementalPCAReduce dimensionality using PCA computed incrementally over mini-batches.
KNNImputerFill missing values by averaging over the K nearest complete neighbours.
LabelBinarizerBinarize a label column into a one-vs-all binary indicator matrix.
LabelEncoderEncode categorical labels as contiguous integer codes in [0, n_classes - 1].
MaxAbsScalerScale each feature by its maximum absolute value to the range [-1, 1].
MinMaxScalerScale each feature to a fixed range, by default [0, 1].
MissingIndicatorAdd binary indicator columns that flag which values were originally missing.
NanRemoverRemove rows that contain NaN (or null-like) values in the scoped columns.
NormalizerNormalize each sample (row) independently to unit norm.
NystroemApproximate a kernel feature map using the Nystroem method.
OneHotEncoderEncode categorical columns as binary indicator (one-hot) vectors.
OrdinalEncoderEncode categorical feature columns as integer ordinal codes.
PCAReduce dimensionality using Principal Component Analysis (PCA).
PolynomialFeaturesGenerate polynomial and interaction features up to a specified degree.
RBFSamplerApproximate the RBF (Gaussian) kernel feature map via random Fourier features.
RandomUnderSamplerConverterBalance class distribution by randomly discarding majority-class samples.
SMOTEConverterBalances class distribution by generating synthetic minority-class samples.
SMOTEENNConverterHybrid sampler combining SMOTE oversampling with Edited Nearest Neighbours cleaning.
SelectFdrSelect features by controlling the expected False Discovery Rate (FDR).
SelectFprSelect features whose p-value is below a False Positive Rate threshold.
SelectFweSelect features while controlling the Family-Wise Error Rate (FWE).
SelectKBestSelect the K highest-scoring features using a univariate statistical test.
SelectPercentileSelect the top percentile of features by a univariate statistical test.
SimpleImputerFill missing values using a simple univariate per-column strategy.
SkewedChi2SamplerApproximate the skewed chi-squared kernel feature map via random Fourier features.
StandardScalerStandardize features by removing the mean and scaling to unit variance.
TFIDFConverterConvert raw text documents into a matrix of TF-IDF weighted features.
TokenizerConverterConverter that tokenizes text and stores each token ID in a separate column.
TruncatedSVDReduce dimensionality using Truncated Singular Value Decomposition (LSA).
VarianceThresholdRemove features whose variance across the training set is below a threshold.