Skip to content

Commit 02c053b

Browse files
committed
Add ML Assignment Files
Add ML Assignment Files
1 parent b17a1fe commit 02c053b

File tree

3 files changed

+25
-0
lines changed

3 files changed

+25
-0
lines changed

.DS_Store

0 Bytes
Binary file not shown.
Lines changed: 25 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,25 @@
1+
---
2+
title: "20240828_ML_Class_Exercises"
3+
output: pdf_document
4+
date: "2024-08-28"
5+
---
6+
7+
1) The CARET package provides a built cell segmentation data set called `segmentationData`. To initialize access to the data, you can simply load the CARET package and call data(segmentationData). The loaded data frame will contain 119 imaging measurements on 2019 individual cells taken from a high content screening assay. The team that generated this data set sought to determine if the cells in the assay were segmented well or poorly.
8+
9+
Read in this data and report how many outcome variables exist in the "Class" variable.
10+
```{r setup}
11+
12+
```
13+
2) Now that you have determined how many outcomes are possible, split your data into 2/3 training data and 1/3 testing data. Make sure you conduct preprocessing before splitting the data! Note: the authors of the publication associated with this data provided a column called "Case" to this data frame indicating they used 50% of their data for training and the other 50% for testing. Remove this column before you conduct downstream processing.
14+
```{r}
15+
16+
```
17+
3) Build an elastic net model that leverages recursive feature elimination to select a subset of important features to be included in your model. Use bootstrapping for cross validation.
18+
```{r}
19+
20+
```
21+
4) Use your new model to predict whether a cell is poorly or well segmented in your testing data set.
22+
```{r}
23+
24+
```
25+

0 commit comments

Comments
 (0)