-
Notifications
You must be signed in to change notification settings - Fork 157
Description
-
Disintegration of MLJModels (medium)
Disintegration of MLJModels MLJModels.jl#244 : priority would be for GLM, with a blank repo at https://github.com/alan-turing-institute/MLJGLMInterface.jl ; you could use https://github.com/alan-turing-institute/MLJNaiveBayesInterface.jl as a template.done -
Universal transformer for wrapping univariate transformers (medium) https://github.com/alan-turing-institute/MLJModels.jl/issues/288 : more detailed design proposal needed. Familiarity with the logic of existing
Standardizer
helpful. This may already be a good template for what we want to do here (just replaceUnivariateStandardizer
by a user-specified one). Need to worry aboutinverse_transform
when implemented. -
Disintegration of MLJBase (medium) parts of Disintegration of MLJBase (discussion and tracking issue) MLJBase.jl#416 , in particular Serialization and OpenML, which seem to be hefty. Worth exploring which dependencies are causing most latency. Also, StatisticalMeasures (medium-long). Added note: Measures currently depend on UnivariateFinite, which in turn depends on Distributions, but only the base API. See this issue: Migrate UnivariateFinite (for categorical distributions) out to new package MLJBase.jl#504
-
pdfnorm for Distributions.jl (??) Feature request: provide L2 norm of each distribution's pdf JuliaStats/Distributions.jl#806: this is one I believe Mose discussed with @fkiraly but was not completed, in an earlier engagement. Would be good to know what the status of that work is.
-
investigate source of package compiler issues (medium) (MLJBase + PackageCompiler + Distributed => error MLJBase.jl#427). Suggest commenting out src/composition/ for a start.
-
[ ]
-
Review/Redesign of model registry (long) Review of model registry MLJModels.jl#321
-
Test new API proposal to improve data resampling performance (medium) taking performance issue more seriously MLJBase.jl#309 (comment)done -
Add visualisation to model tuning results (medium) Add plot recipe for visualising multiple hyperparameter tuning outcomes MLJTuning.jl#41
-
Populate model metadata with good default hyperparameter ranges (short-medium) Populate default hyperparameter ranges in the model metadata. MLJModels.jl#322
-
Allow use of sample and class weights in sk-learn models (medium) Add sample weights support for sk-learn models MLJScikitLearnInterface.jl#17 (and the related https://github.com/alan-turing-institute/MLJModels.jl/issues/127)
-
Add control over logging level (short) [FR] allow disabling of logging #255
added mid November
-
cleanup of measures (short) name, aliases cleanup for measures MLJBase.jl#450done
added early January 2020
-
roll out data front-ends for models (medium) Implement the optional data front-end that models will be able to implement after Realize performance improvements for models implementing new data front-end MLJBase.jl#501 .
-
TLC for DataScienceTutorials (short - medium) The tutorials need updating to latest version of MLJ, and some contributors have made PR's that are languishing.