GLMNet

glmnet is an R package by Jerome Friedman, Trevor Hastie, Rob Tibshirani that fits entire Lasso or ElasticNet regularization paths for linear, logistic, multinomial, and Cox models using cyclic coordinate descent. This Julia package wraps the Fortran code from glmnet.

Quick start

To fit a basic regression model:

julia> using GLMNet

julia> srand(123)

julia> y = [1:100]+randn(100)*10;

julia> X = [1:100 (1:100)+randn(100)*5 (1:100)+randn(100)*10 (1:100)+randn(100)*20];

julia> path = glmnet(X, y)
Least Squares GLMNet Solution Path (74 solutions for 4 predictors in 832 passes):
74x3 DataFrame
| Row | df | pct_dev  | λ         |
|-----|----|----------|-----------|
| 1   | 0  | 0.0      | 29.6202   |
| 2   | 1  | 0.148535 | 26.9888   |
| 3   | 1  | 0.271851 | 24.5912   |
| 4   | 1  | 0.37423  | 22.4066   |
⋮
| 70  | 4  | 0.882033 | 0.0482735 |
| 71  | 4  | 0.882046 | 0.043985  |
| 72  | 4  | 0.882058 | 0.0400775 |
| 73  | 4  | 0.882067 | 0.0365171 |
| 74  | 4  | 0.882075 | 0.033273  |

path represents the Lasso or ElasticNet fits for varying values of λ. The value of the intercept for each λ value are in path.a0. The coefficients for each fit are stored in compressed form in path.betas.

julia> path.betas
4x74 CompressedPredictorMatrix:
 0.0  0.091158  0.174218  …   0.913497   0.915593   0.917647
 0.0  0.0       0.0           0.128054   0.127805   0.127568
 0.0  0.0       0.0          -0.126211  -0.128015  -0.129776
 0.0  0.0       0.0           0.108217   0.108254   0.108272

This CompressedPredictorMatrix can be indexed as any other AbstractMatrix, or converted to a Matrix using convert(Matrix, path.betas).

One can visualize the path by

julia> using Gadfly

julia> plot(path, Guide.xlabel("||β||₁"), Guide.ylabel("βᵢ"), x=:norm1)

One can see that the LASSO path is piecewise linear.

To predict the output for each model along the path for a given set of predictors, use predict:

julia> predict(path, [22 22+randn()*5 22+randn()*10 22+randn()*20])
1x74 Array{Float64,2}:
 50.8669  48.2689  45.9017  …  21.9344  21.9377  21.9407

To find the best value of λ by cross-validation, use glmnetcv:

julia> cv = glmnetcv(X, y)
Least Squares GLMNet Cross Validation
74 models for 4 predictors in 10 folds
Best λ 0.450 (mean loss 129.720, std 14.871)

julia> indmin(cv.meanloss)
46

julia> cv.path.betas[:, 46]
4-element Array{Float64,1}:
 0.781119
 0.128094
 0.0
 0.103008

julia> coef(cv)
4-element Array{Float64,1}:
 0.781119
 0.128094
 0.0
 0.103008

A classification Example

julia> using RDatasets

julia> iris = dataset("datasets", "iris");

julia> X = convert(Matrix, iris[:, 1:4]);

julia> y = convert(Vector, iris[:Species]);

julia> iTrain = sample(1:size(X,1), 100, replace = false);

julia> iTest = setdiff(1:size(X,1), iTrain);

julia> iris_cv = glmnetcv(X[iTrain, :], y[iTrain])
Multinomial GLMNet Cross Validation
100 models for 4 predictors in 10 folds
Best λ 0.001 (mean loss 0.124, std 0.046)

julia> yht = round(predict(iris_cv, X[iTest, :], outtype = :prob), 3);

julia> DataFrame(target=y[iTest], set=yht[:,1], ver=yht[:,2], vir=yht[:,3])[5:5:50,:]
10x4 DataFrame
| Row | target       | set   | ver   | vir   |
|-----|--------------|-------|-------|-------|
| 1   | "setosa"     | 0.999 | 0.001 | 0.0   |
| 2   | "setosa"     | 1.0   | 0.0   | 0.0   |
| 3   | "setosa"     | 1.0   | 0.0   | 0.0   |
| 4   | "versicolor" | 0.0   | 0.983 | 0.017 |
| 5   | "versicolor" | 0.002 | 0.961 | 0.037 |
| 6   | "versicolor" | 0.0   | 0.067 | 0.933 |
| 7   | "versicolor" | 0.0   | 0.993 | 0.007 |
| 8   | "virginica"  | 0.0   | 0.0   | 1.0   |
| 9   | "virginica"  | 0.0   | 0.397 | 0.603 |
| 10  | "virginica"  | 0.0   | 0.025 | 0.975 |

julia> plot(iris_cv.path, Scale.x_log10, Guide.xlabel("λ"), Guide.ylabel("βᵢ"))

julia> plot(iris_cv)

Fitting models

glmnet has two required parameters: the m x n predictor matrix X and the dependent variable y. It additionally accepts an optional third argument, family, which can be used to specify a generalized linear model. Currently, Normal() (least squares, default), Binomial() (logistic), Poisson() , Multinomial(), CoxPH() (Cox model) are supported.

For linear and Poisson models, y is a numerical vector.
For logistic models, y is either a string vector or a m x 2 matrix, where the first column is the count of negative responses for each row in X and the second column is the count of positive responses.
For multinomial models, y is etiher a string vector (with at least 3 unique values) or a m x k matrix, where k is number of unique values (classes).
For Cox models, y is a 2-column matrix, where the first column is survival time and second column is (right) censoring status. Indeed, For survival data, glmnet has another method glmnet(X::Matrix, time::Vector, status::Vector). Same for glmnetcv.

glmnet also accepts many optional keyword parameters, described below:

weights: A vector of weights for each sample of the same size as y.
alpha: The tradeoff between lasso and ridge regression. This defaults to 1.0, which specifies a lasso model.
penalty_factor: A vector of length n of penalties for each predictor in X. This defaults to all ones, which weights each predictor equally. To specify that a predictor should be unpenalized, set the corresponding entry to zero.
constraints: An n x 2 matrix specifying lower bounds (first column) and upper bounds (second column) on each predictor. By default, this is [-Inf Inf] for each predictor in X.
dfmax: The maximum number of predictors in the largest model.
pmax: The maximum number of predictors in any model.
nlambda: The number of values of λ along the path to consider.
lambda_min_ratio: The smallest λ value to consider, as a ratio of the value of λ that gives the null model (i.e., the model with only an intercept). If the number of observations exceeds the number of variables, this defaults to 0.0001, otherwise 0.01.
lambda: The λ values to consider. By default, this is determined from nlambda and lambda_min_ratio.
tol: Convergence criterion. Defaults to 1e-7.
standardize: Whether to standardize predictors so that they are in the same units. Defaults to true. Beta values are always presented on the original scale.
intercept: Whether to fit an intercept term. The intercept is always unpenalized. Defaults to true.
maxit: The maximum number of iterations of the cyclic coordinate descent algorithm. If convergence is not achieved, a warning is returned.

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
deps		deps
src		src
test		test
.travis.yml		.travis.yml
LICENSE.md		LICENSE.md
README.md		README.md
REQUIRE		REQUIRE

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deps

deps

src

src

test

test

.travis.yml

.travis.yml

LICENSE.md

LICENSE.md

README.md

README.md

REQUIRE

REQUIRE

Repository files navigation

GLMNet

Quick start

A classification Example

Fitting models

See also

About

Releases

Packages

Languages

License

linxihui/GLMNet.jl

Folders and files

Latest commit

History

Repository files navigation

GLMNet

Quick start

A classification Example

Fitting models

See also

About

Resources

License

Stars

Watchers

Forks

Languages