asco/static/about.html

28 lines
829 B
HTML
Raw Normal View History

<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<link rel="stylesheet" href="main.css">
<title>About</title>
</head>
<body>
<center><h1>About</h1></center>
<h2>The Data</h2>
<p>
Information was scraped from <a href="//ascopubs.org/jco/meeting">this page</a> on 2023-06-28.
Over 5000 abstract titles from 2023 were used as training data.
Embeddings for these were generated through the OpenAI API, with the <code>text-embedding-ada-002</code> model.
</p>
<h2>The Model</h2>
<p>
A penalized logistic regression model was fit using the <code>glmnet</code> R package.
The tuning parameter was selected using cross validation.
The area under the ROC curve was 0.83 in the training data.
</p>
<center><a href="..">back</a></center>
</body>
</html>