29 lines
873 B
HTML
29 lines
873 B
HTML
<!DOCTYPE html>
|
|
<html lang="en">
|
|
<head>
|
|
<meta charset="UTF-8">
|
|
<link rel="stylesheet" href="main.css">
|
|
<link rel="stylesheet" href="theme.css">
|
|
<title>About</title>
|
|
</head>
|
|
<body>
|
|
<center><h1>About</h1></center>
|
|
|
|
<h2>The Data</h2>
|
|
<p>
|
|
Information was scraped from <a href="//ascopubs.org/jco/meeting">this page</a> on 2023-06-28.
|
|
Over 5000 abstract titles from 2023 were used as training data.
|
|
Embeddings for these were generated through the OpenAI API, with the <code>text-embedding-ada-002</code> model.
|
|
</p>
|
|
|
|
<h2>The Model</h2>
|
|
<p>
|
|
A penalized logistic regression model was fit using the <code>glmnet</code> R package.
|
|
The tuning parameter was selected using cross validation.
|
|
The area under the ROC curve was 0.83 in the training data.
|
|
</p>
|
|
|
|
<center><a href="..">back</a></center>
|
|
</body>
|
|
</html>
|