Add API request IDs, structured errors, pagination and filters by Jagadeeshck · Pull Request #73 · Jagadeeshck/DataObs

Jagadeeshck · 2026-05-21T14:47:41Z

Motivation

Bring API behaviour to production readiness by adding request correlation, consistent structured errors, and safe list handling for large datasets.
Provide a single, predictable error contract and request IDs to help clients and server-side observability/debugging.
Support basic offset pagination and useful query filters on commonly used list endpoints while preserving existing top-level response keys.

Description

Added request ID middleware that accepts an incoming X-Request-ID or generates a UUID, stores it on request.state, and echoes it on every response; request IDs are included in all structured error payloads (src/api/app.py).
Introduced a consistent structured error schema ErrorResponse (error.code, error.message, error.details, request_id) and wired handlers for general HTTPException, StarletteHTTPException, RequestValidationError (400/422) and unhandled exceptions (500) (src/api/app.py).
Implemented pagination models (PaginationMeta) and applied limit/offset query parameters (defaults and caps via Query) to GET /rules, GET /quality/results, GET /lineage/nodes, and GET /lineage/edges, preserving the existing top-level keys (rules, results, nodes, edges) and adding pagination metadata (src/api/app.py).
Added filtering query parameters to the endpoints requested (rules, quality results, lineage nodes/edges) and ensured parameters appear in OpenAPI by using Query parameters (src/api/app.py).
Extended the store contract to accept optional limit/offset and filter parameters, and updated InMemoryStore and ElasticsearchStore to perform filtering/pagination (pushed into ES where practical) while keeping backwards-compatible defaults (src/api/store.py, src/api/es_store.py).
Tests updated/added to cover request ID propagation, generated request IDs, structured error payloads with request_id, pagination and filter behaviour, limit max enforcement and negative offset validation; documentation added at docs/api/rest-api.md with examples and curl snippets.

Testing

Ran byte-compile: python -m compileall -q src tests and it completed successfully (no compile failures).
Ran endpoint/unit tests individually: pytest tests/test_api_endpoints.py -q, pytest tests/test_api_es_store.py -q, pytest tests/test_settings.py -q, pytest tests/test_poc_pipeline.py -q and all targeted test files passed.
Ran full test suite: pytest -q which completed successfully with all tests passing (suite result: tests passed; some integration tests remain skipped by default).

Codex Task

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 22fff0accf

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-21T14:51:16Z

+            return
        token = credentials.credentials if credentials and credentials.scheme.lower() == "bearer" else None
-        # Preserve the legacy parser's exact "Bearer " prefix behavior for unusual clients.
        if token is None and authorization and authorization.startswith("Bearer "):
            token = authorization[len("Bearer "):].strip()
        if token != settings.api_token:


Reject unauthenticated requests when strict auth is enabled

When api_token is unset and allow_unauthenticated_dev is False, this logic still allows requests without an Authorization header: token stays None, and token != settings.api_token is False when both are None. That means protected endpoints become publicly accessible under a strict-auth configuration with a missing token, which is a security regression from the previous behavior.

Useful? React with 👍 / 👎.

@copilot fix issues based on recommendation

chatgpt-codex-connector · 2026-05-21T14:51:16Z

        rules = stores.store.get_all_rules()
-        return {"rules": rules, "count": len(rules)}
+        filtered = [r for r in rules if (dataset is None or r.get("dataset") == dataset) and (enabled is None or r.get("enabled") == enabled) and (severity is None or r.get("severity") == severity) and (check_type is None or r.get("check_type", r.get("type")) == check_type)]
+        page = _paginate(filtered, limit, offset)
+        return {"rules": page["items"], "count": len(page["items"]), "pagination": page["pagination"]}


Propagate /rules limit and offset into the store query

This endpoint now calls get_all_rules() without pagination arguments, then paginates the returned subset in memory. Because store defaults are limit=100, offset=0, requests like ?limit=500 or ?offset=120 can never access records beyond the first 100, and pagination.total is computed from that truncated slice rather than the real result set.

Useful? React with 👍 / 👎.

@copilot fix issues based on recommendation

chatgpt-codex-connector · 2026-05-21T14:51:16Z

+    def list_quality_results(self, limit: int = 100, offset: int = 0, table: Optional[str] = None, status: Optional[str] = None, dataset: Optional[str] = None, check_type: Optional[str] = None, severity: Optional[str] = None, run_id: Optional[str] = None) -> List[Dict[str, Any]]:
        results = list(self._quality.values())
        if table:
            results = [r for r in results if r.get("table") == table]
        if status:


Honor check_type, severity, and run_id in memory filtering

The in-memory implementation advertises check_type, severity, and run_id filters in its method signature, but it only applies table, status, and dataset. In memory-backed deployments/tests, those documented query params are silently ignored, producing incorrect results and backend-inconsistent behavior.

Useful? React with 👍 / 👎.

@copilot fix issues based on recommendation

Add API request IDs, structured errors, pagination and filtering

22fff0a

Jagadeeshck added the codex label May 21, 2026 — with ChatGPT Codex Connector

chatgpt-codex-connector Bot reviewed May 21, 2026

View reviewed changes

Copilot started work on behalf of Jagadeeshck May 21, 2026 14:57 View session

Copilot finished work on behalf of Jagadeeshck May 21, 2026 14:58

Copilot started work on behalf of Jagadeeshck May 21, 2026 14:59 View session

Copilot finished work on behalf of Jagadeeshck May 21, 2026 14:59

Copilot finished work on behalf of Jagadeeshck May 21, 2026 15:00

Jagadeeshck merged commit c9e9e96 into main May 21, 2026
15 of 21 checks passed

Jagadeeshck deleted the codex/add-production-api-features-and-pagination branch June 6, 2026 16:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add API request IDs, structured errors, pagination and filters#73

Add API request IDs, structured errors, pagination and filters#73
Jagadeeshck merged 1 commit into
mainfrom
codex/add-production-api-features-and-pagination

Jagadeeshck commented May 21, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 21, 2026

Uh oh!

Jagadeeshck May 21, 2026

Uh oh!

chatgpt-codex-connector Bot May 21, 2026

Uh oh!

Jagadeeshck May 21, 2026

Uh oh!

chatgpt-codex-connector Bot May 21, 2026

Uh oh!

Jagadeeshck May 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Jagadeeshck commented May 21, 2026

Motivation

Description

Testing

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

Jagadeeshck May 21, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

Jagadeeshck May 21, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

Jagadeeshck May 21, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant