⚠ This page is served via a proxy. Original site: https://github.com
This service does not collect credentials or authentication data.
Skip to content

Conversation

@natoverse
Copy link
Collaborator

Introduce DataReader that wraps TableProvider and applies type coercion functions when loading dataframes from weakly-typed formats (e.g. CSV).

  • Add DataReader class with methods for each table type: entities, relationships, communities, community_reports, covariates, text_units, and documents
  • Add typed loading functions in dfs.py for community_reports, covariates, text_units, and documents (entities, relationships, communities already existed)
  • Integrate DataReader into all 17 indexing workflows replacing raw read_dataframe calls
  • Integrate DataReader into CLI query's _resolve_output_files for typed loading across all search types (global, local, drift, basic)
  • Export DataReader from data_model package init

Introduce DataReader that wraps TableProvider and applies type coercion
functions when loading dataframes from weakly-typed formats (e.g. CSV).

- Add DataReader class with methods for each table type: entities,
  relationships, communities, community_reports, covariates, text_units,
  and documents
- Add typed loading functions in dfs.py for community_reports, covariates,
  text_units, and documents (entities, relationships, communities already
  existed)
- Integrate DataReader into all 17 indexing workflows replacing raw
  read_dataframe calls
- Integrate DataReader into CLI query's _resolve_output_files for typed
  loading across all search types (global, local, drift, basic)
- Export DataReader from data_model package __init__
@natoverse natoverse requested a review from a team as a code owner February 10, 2026 01:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant