Commit Graph

68 Commits

Author SHA1 Message Date
Andrej Karpathy 0f21e69125 Merge branch 'ajdintrejic-master' 2021-12-18 15:06:36 -08:00
Andrej Karpathy 68e4e0f3bd Merge branch 'master' of https://github.com/ajdintrejic/arxiv-sanity-lite into ajdintrejic-master 2021-12-18 15:01:40 -08:00
Andrej Karpathy 6e85778b24 change the way we recommend papers. we'll do it per tag instead of just mixing everything together. i believe this should produce higher quality results 2021-12-18 14:53:57 -08:00
Ajdin Trejic 7a438d9c46 Made site a bit more mobile friendly 2021-12-18 23:30:40 +01:00
Andrej Karpathy 65d4bb8415 some sleep is good for you 2021-12-10 20:28:34 -08:00
Andrej Karpathy cd2a88f375 when a user is logged in and hits the default endpoint '/' show recommendations over the last week instead of just totally random recent papers, by default 2021-12-10 09:15:29 -08:00
Andrej Karpathy 33b2b018ab kind of big changes here: we can now inspect & see the most positive and negative words for a trained svm, to help tune the value C. then there is also the UI for setting value C in the SVM. Finally the value of C I adjusted to default to 0.01 (was 0.1 before) because the results and the weights look more sensible based on manual inspection. We need some dataset of people libraries in order to potentially cross-validate a good value C automatically. For now there are not enough active users of the site that such a thing could be attempted and succeed. Doing my best now just by eyeballing 2021-12-08 00:17:37 -08:00
Andrej Karpathy 6be574a0fe hah don't crash if a user submits an invalid query, better to return empty results. in the future we may want a helpful error message too 2021-12-07 22:56:09 -08:00
Andrej Karpathy 4ec9a75293 add a secret GET argument svm_c that changes the C value in the SVM. no UI for this yet :) 2021-12-07 22:52:13 -08:00
Andrej Karpathy 6308e0cfa4 list out the tags i think 2021-12-07 22:33:50 -08:00
Andrej Karpathy 28163bfcb7 adding the user account into the email, ty @Randl for the suggestion. I think this should be ok to include in plain text here because it is an email direct to the user 2021-12-07 22:27:14 -08:00
Andrej Karpathy 1153a70563 print the total number of emails sent 2021-12-04 00:39:55 -08:00
Andrej Karpathy 6df5f1d178 add requirements.txt ty @Prakyathkantharaju and @e-tornike for help 2021-12-02 16:08:05 -08:00
Andrej Karpathy 31e02f267c handle the edge case of zero positives... sigh 2021-11-30 23:34:47 -08:00
Andrej Karpathy fe96775e6e fix issues and improve logic of removing tags 2021-11-30 22:57:59 -08:00
Andrej Karpathy 9c00c8dacd disallow the special tag null, which can result if user escapes the prompt 2021-11-30 22:51:16 -08:00
Andrej Karpathy a01c637cc8 tweak email sending 2021-11-29 20:45:24 -08:00
Andrej Karpathy d5b91270a9 allow to use fewer documents for training tfidf features to prevent OOMs 2021-11-29 15:38:36 -08:00
Andrej Karpathy e182dda381 make send_emails into a proper script and add a few more widgets 2021-11-29 09:29:59 -08:00
Andrej Karpathy 6d7f648384 visually distinguish the special 'all' tag 2021-11-28 08:23:51 -08:00
Andrej Karpathy 8947dd3fc5 tweak search scoring for better results, i think 2021-11-27 18:09:17 -08:00
Andrej Karpathy aff7c89695 use the older arxiv-sanity colorscheme for header 2021-11-27 17:15:40 -08:00
Andrej Karpathy a83dbb2138 it doesn't align, the ocd is killing me 2021-11-27 14:53:14 -08:00
Andrej Karpathy 5990a938a4 add few more stats 2021-11-27 14:47:28 -08:00
Andrej Karpathy 1769b7b02f minor profile page html/css cleanup 2021-11-27 14:31:18 -08:00
Andrej Karpathy d236a5dff9 small tweaks to not do useless work in special cases 2021-11-27 13:06:00 -08:00
Andrej Karpathy a169f0fa61 adding an about page as well 2021-11-27 11:47:17 -08:00
Andrej Karpathy 6c87055349 slight readme tweak 2021-11-27 11:41:04 -08:00
Andrej Karpathy 6d4d1eb78d tweak readme 2021-11-27 11:38:31 -08:00
Andrej Karpathy 6685b0a05f i think this detail matters... 2021-11-27 11:23:35 -08:00
Andrej Karpathy 5eece3f992 add email sending script to repo 2021-11-27 11:19:55 -08:00
Andrej Karpathy 49995465df add ability to store an email for a user for recommendations 2021-11-27 11:04:36 -08:00
Andrej Karpathy 970a9e24cf maintain last active state for users 2021-11-27 09:37:02 -08:00
Andrej Karpathy 47fba66712 add a new stats page 2021-11-26 21:57:48 -08:00
Andrej Karpathy 6a0ff6a9bf clarify even further 2021-11-26 21:10:22 -08:00
Andrej Karpathy b52b64e0f7 don't show the +/- controls for tags when a person is not logged in, it's confusing 2021-11-26 21:09:10 -08:00
Andrej Karpathy e4fe77d118 show user warning if they are not logged in that things won't work 2021-11-26 20:57:20 -08:00
Andrej Karpathy c3ce3e3f70 update the screenshot since the interface changed quite a bit 2021-11-26 20:33:10 -08:00
Andrej Karpathy 4538eedb84 update the readme 2021-11-26 20:31:57 -08:00
Andrej Karpathy 9c88a4eea4 use the process exit code to communicate whether any updates successfully made it into the database at all 2021-11-26 20:19:48 -08:00
Andrej Karpathy d2582c5b77 if the very first API call fails it is very unlikely that there are new papers at all, no need to try multiple pages 2021-11-26 20:10:32 -08:00
Andrej Karpathy aa877c9397 when writing features do it safely and atomically 2021-11-26 20:00:37 -08:00
Andrej Karpathy c3161b2a49 do not reveal username since they are kind of secret now 2021-11-26 17:11:19 -08:00
Andrej Karpathy 5b5b503a93 let things breathe a bit more 2021-11-26 16:44:21 -08:00
Andrej Karpathy c5b5c7833c and i think that's it, we now support user accounts (lite)git commit -m 'and i think that\'s it, we now support user accounts litegit status sweet.'! sweet. 2021-11-26 16:38:36 -08:00
Andrej Karpathy 54157c6e9c ok here we go, the beginning of user profiles. for now we assume there is a single hard-coded user root 2021-11-26 11:07:16 -08:00
Andrej Karpathy dc55146e03 few notes on some outstanding todos 2021-11-26 10:28:49 -08:00
Andrej Karpathy 3e2d1248d8 speed up time filtering by 20X by loading all of metas into memory once instead of querying it item by item in a massive loop 2021-11-26 10:22:52 -08:00
Andrej Karpathy 3d5bbb9851 optimization: read only metas if that's all that we need to complete the request. that said, i expected this to run much faster and it doesn't. i think the way i'm using sqlites here is not exactly efficient or appropriate and i have to re-think the database schema. e.g. getting random papers over the last 7 days should be almost instantaneous, but currently this request takes like 3 seconds. have to better understand why this is happening 2021-11-26 09:51:48 -08:00
Andrej Karpathy b283c85c72 this was bothering me 2021-11-26 09:36:09 -08:00