Commit Graph

71 Commits

Author SHA1 Message Date
Andrej Karpathy 3e2d1248d8 speed up time filtering by 20X by loading all of metas into memory once instead of querying it item by item in a massive loop 2021-11-26 10:22:52 -08:00
Andrej Karpathy 3d5bbb9851 optimization: read only metas if that's all that we need to complete the request. that said, i expected this to run much faster and it doesn't. i think the way i'm using sqlites here is not exactly efficient or appropriate and i have to re-think the database schema. e.g. getting random papers over the last 7 days should be almost instantaneous, but currently this request takes like 3 seconds. have to better understand why this is happening 2021-11-26 09:51:48 -08:00
Andrej Karpathy b283c85c72 this was bothering me 2021-11-26 09:36:09 -08:00
Andrej Karpathy 8c74150337 eliminate a whole spurious endpoint of /search and remove code duplication 2021-11-26 09:31:06 -08:00
Andrej Karpathy 470bd00563 refactor and clean up the code in preparation for some consolidation, coming up 2021-11-26 09:10:35 -08:00
Andrej Karpathy f565eba1c7 use jinja templates so there's no copy paste code all over the place 2021-11-25 14:45:08 -08:00
Andrej Karpathy 46f556caf6 example makefile 2021-11-25 13:51:52 -08:00
Andrej Karpathy 1ed6e3f1b0 ok now we can sequester all the database files into data/ folder so everything is nice and clean yay 2021-11-25 13:47:45 -08:00
Andrej Karpathy 77279e1777 sequester all file sytem IO ops only to db.py, so it's not total chaos 2021-11-25 13:28:04 -08:00
Andrej Karpathy bd11865a49 add an option to break out early when we've pulled in all new papers most likely 2021-11-24 09:15:36 -08:00
Andrej Karpathy cf1bef6f53 big new feature: ability to inspect any paper to see the raw tfidf tokens and their weights that summarize the paper, and which powers the SVM recommendation engine. basically a bit of a debugging / insight feature, but a really good sanity check that papers are being properly represented 2021-11-21 20:51:01 -08:00
Andrej Karpathy e5798ddb2f make the header link to root of the site as the case should always be ok cool 2021-11-21 14:03:17 -08:00
Andrej Karpathy 548ee210df better default parameters, based on qualitative inspection of tfidf features and word vectors 2021-11-21 13:46:14 -08:00
Andrej Karpathy 8be24dae43 allow for full sorting, slice and dicing with all options in the UI, and the previous shortcuts. now have a lot more flexibility to slice and dice as desired 2021-11-16 21:39:43 -08:00
Andrej Karpathy abb83effed add ability to remove a tag from a paper as well 2021-11-14 21:49:34 -08:00
Andrej Karpathy 7cbb90a480 minor logging changes 2021-11-12 22:49:29 -08:00
Andrej Karpathy 158ced647b add a screenshot and rearrange the readme a bit 2021-11-12 21:36:45 -08:00
Andrej Karpathy f02766be2c oops need to open for writing if changing the tags dict 2021-11-12 21:23:22 -08:00
Andrej Karpathy 13a1d5ff48 sequester gross details about database instantiation in the filesystem away from the scripts 2021-11-12 21:12:09 -08:00
Andrej Karpathy 194b7f4b22 first leet codes 2021-11-12 20:40:19 -08:00
Andrej 089adab199 Initial commit 2021-11-12 20:34:22 -08:00