SelectFpr

Converter

DashAI.back.converters.scikit_learn.SelectFpr

Select features whose p-value is below a False Positive Rate threshold.

SelectFpr retains every feature whose p-value, computed by a univariate scoring function against the target, is strictly less than alpha. Under the null hypothesis that a feature is independent of the target, the expected proportion of falsely retained features (false positives) is at most alpha. No multiple-testing correction is applied; each feature is tested at the raw significance level.

This selector is the most permissive of the three p-value-based filters (FPR, FDR, FWE). It is appropriate when the cost of missing a true feature outweighs the cost of including a small number of irrelevant ones, and when the number of features is moderate enough that the uncorrected type-I error rate is acceptable.

Key properties:

Supervised: requires the target array y at fit time.
alpha is the significance threshold in [0, 1]; typical values are 0.05 or 0.10.
No correction for multiple comparisons: more liberal than FDR and FWE.
The number of retained features is data-driven and not fixed in advance.

Wraps scikit-learn's SelectFpr.

References

[1] https://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.SelectFpr.html

Parameters

alpha : number, default=0.05: The highest p-value for features to be kept.

Methods

get_output_type(self, column_name: str = None) -> DashAI.back.types.dashai_data_type.DashAIDataType

Defined on SelectFpr

Return the DashAI data type produced by this converter for a column.

Parameters

column_name : str, optional: Not used; all output columns share the same type. Defaults to None.

Returns

DashAIDataType: A Float type backed by pyarrow.float64().

fit(self, x: 'DashAIDataset', y: Optional[ForwardRef('DashAIDataset')] = None) -> DashAI.back.converters.base_converter.BaseConverter

Defined on SklearnWrapper

Fit the scikit-learn transformer to the data.

Parameters

x : DashAIDataset: The input dataset to fit the transformer on.
y : DashAIDataset, optional: Target values for supervised transformers. Defaults to None.

Returns

BaseConverter: The fitted transformer instance (self).

get_metadata(cls) -> 'Dict[str, Any]'

Defined on BaseConverter

Get metadata for the converter, used by the DashAI frontend.

Parameters

cls : type: The converter class (injected automatically by Python for classmethods).

Returns

Dict[str, Any]: Dictionary containing display name, short description, image preview path, category, icon, color, and whether the converter is supervised.

get_schema(cls) -> dict

Defined on ConfigObject

Generates the component related Json Schema.

Returns

dict: Dictionary representing the Json Schema of the component.

transform(self, x: 'DashAIDataset', y: Optional[ForwardRef('DashAIDataset')] = None) -> 'DashAIDataset'

Defined on SklearnWrapper

Transform the data using the fitted scikit-learn transformer.

Parameters

x : DashAIDataset: The input dataset to transform.
y : DashAIDataset, optional: Not used. Present for API consistency. Defaults to None.

Returns

DashAIDataset: The transformed dataset with updated DashAI column types.

validate_and_transform(self, raw_data: dict) -> dict

Defined on ConfigObject

It takes the data given by the user to initialize the model and returns it with all the objects that the model needs to work.

Parameters

raw_data : dict: A dictionary with the data provided by the user to initialize the model.

Returns

dict: A validated dictionary with the necessary objects.

References​

Parameters​

Methods​

References

Parameters

Methods