Papers Explored
Papers Explored by categories in reversed chronological order of publication. generated by jekyll-scholar.
2025
- International AI Safety ReportarXiv preprint arXiv:2501.17805, 2025
- On the Biology of a Large Language ModelTransformer Circuits Thread, 2025
- Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learningarXiv preprint arXiv:2501.12948, 2025
2024
- NEURAL NETWORK COMPRESSION: THE FUNCTIONAL PERSPECTIVEIn 5th Workshop on practical ML for limited/low resource settings, 2024
- Knowledge Distillation: The Functional PerspectiveIn NeurIPS 2024 Workshop on Scientific Methods for Understanding Deep Learning, 2024
- Deepseek-v2: A strong, economical, and efficient mixture-of-experts language modelarXiv preprint arXiv:2405.04434, 2024
- Deepseek-v3 technical reportarXiv preprint arXiv:2412.19437, 2024
- Better & faster large language models via multi-token predictionarXiv preprint arXiv:2404.19737, 2024
- Deepseekmath: Pushing the limits of mathematical reasoning in open language modelsarXiv preprint arXiv:2402.03300, 2024
2023
- Amortizing intractable inference in large language modelsarXiv preprint arXiv:2310.04363, 2023
- Gflownet foundationsJournal of Machine Learning Research, 2023
- Learning gflownets from partial episodes for improved convergence and stabilityIn International Conference on Machine Learning, 2023
2022
- Trajectory balance: Improved credit assignment in gflownetsAdvances in Neural Information Processing Systems, 2022