ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-05-23 09:28:06 +08:00

Files

Ahmad Intisar 3c4d1da98f Feature/table parser column roles (#13710 )

### What problem does this PR solve?

The table file parser (CSV/Excel) currently treats all columns
identically — every column is both vectorized (embedded in chunk text)
and stored as filterable metadata. There's no way for users to control
which columns should be searchable by semantic meaning versus which
should only be filterable attributes.

For example, when ingesting a news articles CSV with columns like title,
content, country, category, source, etc., the embedding includes
metadata fields like country: Brazil and source: Reuters in the chunk
text, which dilutes the semantic quality of the embedding without adding
retrieval value.

The RDBMS connector (MySQL/PostgreSQL) already supports content_columns
/ metadata_columns, but this capability was missing for file-based table
ingestion.

This PR adds column-level control (vectorize / metadata / both) for the
table file parser, following RAGFlow's existing patterns.

Backward compatible: Datasets without table_column_roles or with
table_column_mode: auto behave exactly as before (all columns = both).

### Type of change

- [x] New Feature (non-breaking change which adds functionality)

2026-05-11 10:06:04 +08:00

.agents/skills/tanstack-query-best-practices

Fix: The dataset on the search page is not displaying the required field error message. (#14041 )

2026-04-10 18:20:50 +08:00

.husky

feat: format code before submitting it #1251 (#1252 )

2024-06-24 14:48:21 +08:00

.storybook

feat(storybook): Storybook with Calendar and Modal components #9869 (#10626 )

2025-10-17 09:58:52 +08:00

public

Fix: replace session page icons and fix nested list search functionality in filters (#13127 )

2026-02-12 19:48:35 +08:00

src

Feature/table parser column roles (#13710 )

2026-05-11 10:06:04 +08:00

.env

Feat: Place the language configuration in web/.env for easy user configuration. (#13920 )

2026-04-03 16:50:18 +08:00

.env.development

Feat: add memory function by go (#13754 )

2026-03-27 09:49:50 +08:00

.env.production

Feat: add skills space to context engine (#13908 )

2026-04-30 12:36:03 +08:00

.eslintrc.cjs

Feat: Add the user_id field to the agent log table and the embedded page. (#13596 )

2026-03-13 19:06:18 +08:00

.gitignore

Feat: Use storybook to display public components. #9914 (#9915 )

2025-09-04 17:03:36 +08:00

.npmrc

Fix: Limit node version #3547 (#3563 )

2024-11-21 18:14:22 +08:00

.prettierignore

feat: install prettier to format code and add react-dev-inspector to locate code in the IDE faster (#44 )

2024-01-29 15:02:27 +08:00

.prettierrc

Feat: Add background to next login page #3221 (#4474 )

2025-01-14 13:43:18 +08:00

CLAUDE.md

chore(CLAUDE.md): add shared UI component lock convention to CLAUDE.md (#14381 )

2026-04-27 12:03:32 +08:00

externals.d.ts

fix: cannot save the system model setting #468 (#508 )

2024-04-23 17:46:56 +08:00

index.html

Refactor: UmiJs -> Vite (#12410 )

2026-01-04 19:14:20 +08:00

jest-setup.ts

feat: test buildNodesAndEdgesFromDSLComponents (#940 )

2024-05-27 19:35:14 +08:00

jest.config.ts

feat: test buildNodesAndEdgesFromDSLComponents (#940 )

2024-05-27 19:35:14 +08:00

package-lock.json

Fix: The button styles in the PaddleOCR dialog are not applying correctly. (#14350 )

2026-04-24 20:17:01 +08:00

package.json

Fix: The button styles in the PaddleOCR dialog are not applying correctly. (#14350 )

2026-04-24 20:17:01 +08:00

postcss.config.cjs

Refactor: UmiJs -> Vite (#12410 )

2026-01-04 19:14:20 +08:00

README.md

Update Admin UI user guide docs (#11183 )

2025-11-11 20:29:20 +08:00

skills-lock.json

Fix: The dataset on the search page is not displaying the required field error message. (#14041 )

2026-04-10 18:20:50 +08:00

tailwind.config.js

Feat: Optimize the style of the chat page. (#13429 )

2026-03-06 11:42:25 +08:00

tailwind.css

refactor(ui): adjust global navigation bar style (#13419 )

2026-03-05 20:47:29 +08:00

tsconfig.json

Fix: Login page type error. (#14156 )

2026-04-16 18:46:52 +08:00

tsconfig.node.json

Refactor: UmiJs -> Vite (#12410 )

2026-01-04 19:14:20 +08:00

vite.config.ts

Go CLI: fix register user (#14665 )

2026-05-08 15:53:06 +08:00

README.md

Install front-end dependencies

npm install

Launch front-end

npm run dev

The following output confirms a successful launch of the system:

Open your browser and navigate to:

http://localhost:9222 or http://[YOUR_MACHINE_IP]:9222

Replace [YOUR_MACHINE_IP] with your actual machine IP address (e.g., http://192.168.1.49:9222).

Open your browser and navigate to:

http://localhost:9222/admin or http://[YOUR_MACHINE_IP]:9222/admin

Replace [YOUR_MACHINE_IP] with your actual machine IP address (e.g., http://192.168.1.49:9222/admin).

Shutdown front-end

Ctrl + C or

kill -f "umi dev"

README.md

Install front-end dependencies

Launch front-end

Login to RAGFlow web UI

Login to RAGFlow web admin UI

Shutdown front-end