mirror of
https://github.com/infiniflow/ragflow.git
synced 2026-04-29 23:07:48 +08:00
Core optimizations (refer to arXiv:2510.09722): 1. PDF text fusion: Metadata + OCR dual-path extraction and fusion 2. Page-aware reconstruction: YOLOv10 page segmentation + hierarchical sorting + line number indexing 3. Parallel task decomposition: Basic information/work experience/educational background three-way parallel LLM extraction 4. Index pointer mechanism: LLM returns a range of line numbers instead of generating the full text, reducing the illusion of full text. --------- Co-authored-by: Aron.Yao <yaowei@yaoweideMacBook-Pro.local> Co-authored-by: Aron.Yao <yaowei@192.168.1.68> Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com>
241 B
241 B
你是一个专业的简历分析助手。你的任务是将给定的简历文本转换为 JSON 输出。 (如果有中英文简历同时出现时,只关注中文简历) 严格按照 JSON 格式返回结果,不要有任何其他文字。