Feature request: allow some form of regularisation in multinomial fits?

More as a feature request: do you think it might be possible to allow some form of regularisation in `mblogit` multinomial fits?
When one is dealing with sparse data, one now often encounters problems of complete separation, resulting in very large errors. This could be resolved if some form of regularisation could be added. I understand that for logistic regression, a ridge (L2 norm) penalty on the coefficients can be added just by changing the IRLS algorithm ([https://bwlewis.github.io/GLM/](https://bwlewis.github.io/GLM/)) so that in the least square regression step of the IRLS algorithm it uses a ridge regression instead of a regular OLS regression. This can be done just by augmenting the covariate matrix X with a p x p covariate matrix with p = nr of columns of the covariate matrix and sqrt(lambdas) along the diagonal and augmenting the outcome variable with p zeros (with lambdas=a vector of L2 norm penalty coefficients=lambda*penalty weights, with vector penalty weights typically being set to 0 for the intercept and to 1 for the rest, or in case of adaptive ridge regression to 1/(abs(coef)) or 1/(coef^2), where coef are the coefficients obtained e.g. in an earlier ridge regression) (see [https://stats.stackexchange.com/questions/69205/how-to-derive-the-ridge-regression-solution](https://stats.stackexchange.com/questions/69205/how-to-derive-the-ridge-regression-solution) and [https://stats.stackexchange.com/questions/137057/ridge-penalized-glms-using-row-augmentation](https://stats.stackexchange.com/questions/137057/ridge-penalized-glms-using-row-augmentation)). I presume the same recipe could be applied for a multinomial regression, in which case incorporating such a feature might not be too hard - in that case it could just be a matter of changing line 37 in https://github.com/melff/mclogit/blob/master/pkg/R/mclogit-fit.R to a weighted least square ridge regression. 

For example, in a multinomial model looking at SARS-CoV2 lineage dynamics through time in different countries & continents
`lineage ~ date+date:continent+date:country+country`
it would be nice if I could penalize just the date:country logistic slopes, and leave all the intercepts & the per-continent logistic slopes unpenalized, in order to shrink the per-country logistic slopes a little towards the ones of the continent where the country is in.

Do you think such a feature could be incorporated in the future?

I saw that package MGLM tried to incorporate LASSO penalties for multinomial models in their function
[https://www.rdocumentation.org/packages/MGLM/versions/0.2.1/topics/MGLMsparsereg](https://www.rdocumentation.org/packages/MGLM/versions/0.2.1/topics/MGLMsparsereg)
but I had no luck getting it to work (and it also didn't provide a `vcov` method).
And glmnet of course allows for elastic net regularised multinomial models (but that one doesn't return the variance-covariance matrix and doesn't have a formula interface).

In terms of regularised (L1/LASSO or L2/ridge penalized) multinomial models I also saw this one,
[https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=1424458](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=1424458). Though I believe the variance-covariance matrix only has a closed-form solution for the ridge penalized case.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: allow some form of regularisation in multinomial fits? #30

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Feature request: allow some form of regularisation in multinomial fits? #30

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions