asco/static/about.html
2024-12-23 10:02:39 -05:00

29 lines
873 B
HTML

<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<link rel="stylesheet" href="main.css">
<link rel="stylesheet" href="theme.css">
<title>About</title>
</head>
<body>
<center><h1>About</h1></center>
<h2>The Data</h2>
<p>
Information was scraped from <a href="//ascopubs.org/jco/meeting">this page</a> on 2023-06-28.
Over 5000 abstract titles from 2023 were used as training data.
Embeddings for these were generated through the OpenAI API, with the <code>text-embedding-ada-002</code> model.
</p>
<h2>The Model</h2>
<p>
A penalized logistic regression model was fit using the <code>glmnet</code> R package.
The tuning parameter was selected using cross validation.
The area under the ROC curve was 0.83 in the training data.
</p>
<center><a href="..">back</a></center>
</body>
</html>