Files
ragflow/rag/prompts/resume_project_exp_en.md
Yao Wei f8c91e8854 Refa: Resume parsing module (architectural optimizations based on SmartResume Pipeline) (#13255)
Core optimizations (refer to arXiv:2510.09722):

1. PDF text fusion: Metadata + OCR dual-path extraction and fusion

2. Page-aware reconstruction: YOLOv10 page segmentation + hierarchical
sorting + line number indexing

3. Parallel task decomposition: Basic information/work
experience/educational background three-way parallel LLM extraction

4. Index pointer mechanism: LLM returns a range of line numbers instead
of generating the full text, reducing the illusion of full text.

---------

Co-authored-by: Aron.Yao <yaowei@yaoweideMacBook-Pro.local>
Co-authored-by: Aron.Yao <yaowei@192.168.1.68>
Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com>
2026-03-02 19:05:50 +08:00

1.2 KiB

Please extract project experience from the following line-indexed resume text.

{indexed_text}

Extract into JSON, each project experience entry contains: {{ "projectExperience": [ {{ "project_name": "", "role": "", "start_date": "", "end_date": "", "desc_lines": [start_index, end_index] }} ] }}

Field descriptions:

  • project_name: Project name
  • role: Role/responsibility, e.g. "Project Lead", "Backend Developer"
  • start_date: Start date, format %Y.%m or %Y
  • end_date: End date, use "Present" if ongoing, "" if not available
  • desc_lines: [start_line, end_line], line number range for project description (integer array)
    • Refers to the original text reference range for project description, including project content, tech stack, achievements, etc.
    • Does not include lines containing project_name, role, start_date, end_date
    • Include as much as possible until the next project experience entry or other section heading
    • STOP before these section headings (do not include them in desc_lines): Self-evaluation, Personal Summary, Skills, Technical Skills, Education, Work Experience, Certificates, Languages, Hobbies, Career Objective
    • Use [] if not available

Return JSON only. /no_think