科研经历简介
关于大学三年科研经历的总结
Proximal Policy Optimization(PPO)
ChatGPT进行RLHF的核心方法!简要推导一下该模型
Post Tuned Hashing - A New Approach to Indexing High-dimensional Data
PTH,一个新的步骤,重建二值空间!
Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic Segmentation
FixMatch即可接近SOTA!那我要是优化一下思路呢?
Contrastive Learning based Vision-Language Pre-Training
基于对比学习的多模态预训练方法,以Vision-Language PTMs为例
Large-Language-Model For Math
让LLM大模型解决数学问题! -- \#TODO
Image2Poem
此情此景,何不吟诗一首?Image-to-Poem帮你完成!