Developed and deployed multiple AIGC models for advertising creative generation, achieving an average +14.44% improvement in offline quality metrics, a 2.53x QPS increase, and online gains of +0.56% CTR, +0.27% CPM1, and +1.10% cost efficiency.
Designed and implemented a multi-source LLM distillation strategy, generating over 800,000 high-quality training samples through multi-stage filtering and online data feedback.
Optimized model training with SFT cold-start and DPO alignment, boosting model output diversity by +2.78% and rejection recall to 96.18% for online service safety.
Spearheaded research into black-box reward modeling and RL optimization for LLMs, achieving a new Pareto front with +2.36% average quality improvement in offline evaluations.
Optimized search ad fine-grained embedding models, boosting clustering advantage by +10.37%, NER F1 by +9.56%, expanding long-tail PV coverage by +3.98%, and achieving an 82.7% topic adoption rate.