Adding your own models to the benchmark
To run your own models,
implement an
AlgInterfacesubclass. There are numerous examples already implemented. For models that can only run a single train-validation-test split at a time, you might want to subclass or modifySklearnSubSplitInterfacefrompytabkit/models/alg_interfaces/sub_split_interfaces.py. Examples can be found inpytabkit/models/alg_interfaces/other_interfaces.pyorpytabkit/models/alg_interfaces/rtdl_interfaces.py.add an
AlgInterfaceWrappersubclass. This is often just a three-liner that specifies which AlgInterfaces subclass to instantiate. See the numerous examples inpytabkit/bench/alg_wrappers/interface_wrappers.py, especially the later ones.adjust the code to run your
AlgInterfaceWrapperon the benchmark, seescripts/run_experiments.pyfor many examples. Note thatRunConfighas an option to save the model predictions on the whole datasets, which can significantly increase the disk usage (can be up to 2 GB per model on the meta-test-class benchmark).