Registry reference
Experiment components
The preferred way to set up an experiment is to use a YAML configuration. Use the LR system selection helper to learn how to use the YAML interface.
This page lists the components that may be needed to set up an experiment.
Experiment strategies
Registry section: experiment_strategies
experiment_strategies.single_run– Prepare Experiment consisting of a single run using configuration values.experiment_strategies.grid– Prepare Experiment consisting of multiple runs using configuration values.experiment_strategies.optuna– Prepare Experiment for optimizing configuration parameters.
Data strategies
Registry section: data_strategies
data_strategies.train_test– Split the data into a training set and a test set.data_strategies.cross_validation– K-fold cross-validation iterator over successive train/test splits.data_strategies.train_test_sources– Split the data into a training set and a test set by their source ids.data_strategies.cross_validation_sources– K-fold cross-validation by source id.data_strategies.predefined_train_test– Split data into a training set and a test set based on predefined assignments.data_strategies.predefined_cross_validation– Split data into cross validation folds based on predefined assignments.data_strategies.train_test_pairs– A train/test split policy for paired instances.
Data providers
Registry section: data_providers
data_providers.synthesized_normal_binary– Implementation of a data source generating normally distributed binary class data.data_providers.synthesized_normal_multiclass– Implementation of a data source generating normally distributed multiclass data.data_providers.glass– LA-ICP-MS measurements of elemental concentration from floatglass.data_providers.parse_features_from_csv_file– Read CSV data from file.data_providers.parse_features_from_csv_url– Read CSV data from a URL.
Metrics
Registry section: metric
metric.cllr– Calculate a log likelihood ratio cost (C_llr) for a series of log likelihood ratios.metric.cllr_min– Estimate the discriminative power from a collection of log likelihood ratios.metric.cllr_cal– Calculate the difference between the C_llr before and after isotonic calibration.metric.llr_lower_bound– Provide corresponding lower bound for provided LLR data.metric.llr_upper_bound– Provide corresponding upper bound for provided LLR data.metric.devpav– Calculate devPAV for LR data under H1 and H2.
Output
Registry section: output
output.metrics– Helper class to write aggregated results to CSV file.output.pav– Generate a plot of pre-calibrated versus post-calibrated LRs using Pool Adjacent Violators (PAV).output.ece– Generate an ECE plot for a set of LRs and corresponding ground-truth labels.output.lr_histogram– Plot the 10log LRs.output.llr_interval– Plot the LRs on the x-axis, with the relative interval score on the y-axis.output.llr_overestimation– Plot LLR-overestimation as a function of the system LLR.output.nbe– Generate the visual NBE plot using matplotlib.output.invariance_delta_function– Plot Invariance Verification delta functions and LR bounds.output.tippett– Plot empirical cumulative distribution functions of same-source and different-sources LRs.output.score_to_llr– Aggregation that generates plots by repeatedly calling a plotting function.output.score_distribution– Aggregation that generates plots by repeatedly calling a plotting function.output.save_model– Write the model to a file.output.case_llr– Aggregation that applies a full-data-fitted LR system to case data and stores LLRs as CSV.output.by_category– Aggregation method that manages data categorization.
Hyperparameters
Registry section: hyperparameter_types
hyperparameter_types.categorical– Parse a categorical hyperparameter from configuration.hyperparameter_types.cluster– Parse the configuration section of a clustered hyperparameter.hyperparameter_types.constant– Parse the configuration section of a constant.hyperparameter_types.float– Parse a floating-point hyperparameter from configuration.hyperparameter_types.folder– Parse a folder hyperparameter from configuration.
LR system components
You may choose to use either the Python API or a YAML configuration to set up an LR system.
Use the Practitioner’s Guide to learn how to use the Python API.
Use the LR system selection helper to learn how to use the YAML interface.
This page lists the components that may be needed to define an LR system.
LR system architecture
Registry section: lrsystem_architecture
lrsystem_architectures.specific_source– LR system for binary data and a linear pipeline.lrsystem_architectures.score_based– Provide a representation of a common source, score-based LR system.lrsystem_architectures.two_level– Implement two level model, common-source feature-based LR system architecture.
LR system modules
Registry section: modules
modules.pipeline– A pipeline of processing modules.modules.logging_pipeline– A pipeline that writes debugging output to a CSV file.modules.bootstrap– Bootstrap system that estimates confidence intervals around the best estimate of a pipeline.modules.standard_scaler– Standardize features by removing the mean and scaling to unit variance.modules.probabilities_to_odds– Convert a probability to odds.modules.probabilities_to_logodds– Convert probability values to their log odds with base 10.modules.element_wise_difference– Calculate the element-wise absolute difference between pairs.modules.euclidean_distance– Calculate the Euclidean distance between pairs.modules.manhattan_distance– Calculate the Manhattan distance between pairs.modules.tee– Parse configuration for allowing multiple tasks for given input.modules.csv_writer– Implementation of a transformation step in a scikit-learn Pipeline that writes to CSV.modules.save_features– Transformer to save the features of the instances in a new field.modules.validate_feature_data_type– Module that validates the data types of the features in the instances.modules.identity– Represent the Identity function of a transformer.modules.logistic_regression– Logistic Regression (aka logit, MaxEnt) classifier.modules.svm– C-Support Vector Classification.modules.kde– Calculate LR from a score, belonging to one of two distributions using KDE.modules.isotonic_calibrator– Calculate LR from a score belonging to one of two distributions using isotonic regression.modules.logistic_calibrator– Calculate LR from a score, belonging to one of two distributions using logistic regression.modules.mcmc– Use Markov Chain Monte Carlo simulations to fit a statistical distribution for each of the two hypotheses.modules.static_bounder– Bound LLRs to constant values.modules.elub_bounder– Calculate the Empirical Upper and Lower Bounds for a given LR system.modules.iv_bounder– Calculate Invariance Verification bounds for a given LR system.modules.n_source_bounder– Bound LLRs based on the number of sources.
Pairing methods
Registry section: pairing
pairing.instance_pairs– Construct pairs from a set of instances.pairing.source_pairs– Construct pairs of sources (i.e. classes) from an array of instances.