chore: move warning logs to error logs by TeddyCr · Pull Request #27728 · open-metadata/OpenMetadata

TeddyCr · 2026-04-24T22:41:12Z

Describe your changes:

Fixes logging to capture errors

Type of change:

Checklist:

I have read the CONTRIBUTING document.
My PR title is Fixes <issue-number>: <short explanation>
I have commented on my code, particularly in hard-to-understand areas.
For JSON Schema changes: I updated the migration scripts or explained why it is not needed.

Summary by Gitar

Logging improvements:
- Updated client.py to correctly log response content types as warnings instead of errors.
- Adjusted test_mcp_client.py assertions to reflect the change from warning to error logging for server connection failures.

_{This will update automatically on new commits.}

gitar-bot · 2026-04-24T22:43:09Z

            return json.loads(content)
        except json.JSONDecodeError as e:
-            logger.warning(f"Failed to parse as JSON despite content-type: {e}")
+            logger.error(f"Failed to parse as JSON despite content-type: {e}")


⚠️ Quality: Blanket warning→error promotion misclassifies non-critical failures

This PR promotes all logger.warning() calls to logger.error() without distinguishing between genuinely erroneous conditions and expected/graceful degradation scenarios. In many logging frameworks and monitoring setups, ERROR level triggers alerts and pages on-call engineers, while WARNING is for recoverable or expected-but-unusual situations.

Several of these changes are semantically incorrect:

parse_openapi_schema (parser.py:57,62) — When JSON parsing fails here, the code falls through to a generic JSON→YAML fallback (lines 64-80). This is normal control flow, not an error. Logging at ERROR on a successful fallback path will generate false alarm noise.

test_server_connection (connection.py:146) — This is a test function designed to return True/False. A failed connection is an expected outcome, not an error. (Also note the inconsistency: McpProtocolError is now error while the generic Exception on line 149 stays warning.)

_test_connection_steps_automation_workflow (test_connections.py:138) — Test connection steps are expected to fail sometimes; the result is reported back to the user via TestConnectionStepResult. Logging each failure as ERROR is misleading.

find_database_service_by_hostname / find_messaging_service_by_brokers (kafkaconnect/metadata.py:234,296) — These return None on failure and callers handle the absence gracefully. A missing service match is a warning, not an error.

get_aws_bucket_region (s3/metadata.py:760) — Falls back to the configured region on line 761. This is graceful degradation.

Recommendation: Review each call site individually. Reserve logger.error() for conditions that indicate something is broken and needs attention (e.g., data corruption, unrecoverable API failures). Keep logger.warning() for expected-but-unusual situations where the code continues gracefully.

Suggested fix:

Audit each site individually. A helpful heuristic: - ERROR: The operation failed AND the caller cannot recover (e.g., data loss, broken pipeline) - WARNING: The operation failed BUT the code degrades gracefully (e.g., returns None, uses a fallback) - DEBUG: Expected control-flow branches

_{Was this helpful? React with 👍 / 👎 | Reply gitar fix to apply this suggestion}

Copilot

Pull request overview

Updates ingestion and data-quality modules to emit ERROR logs (instead of WARNING) for a variety of caught exceptions, aiming to improve error visibility in logs.

Changes:

Switched many exception-path log statements from logger.warning(...) to logger.error(...) across ingestion sources and utilities.
Switched multiple data-quality validation abort logs from warning to error.

Reviewed changes

Copilot reviewed 56 out of 56 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
ingestion/src/metadata/pii/scanners/ner_scanner.py	Promotes processing-exception logs to `ERROR` in the NER PII scanner.
ingestion/src/metadata/ingestion/source/storage/s3/metadata.py	Promotes S3 bucket region lookup failure logging to `ERROR`.
ingestion/src/metadata/ingestion/source/search/opensearch/parser.py	Promotes OpenSearch mapping parse failure logging to `ERROR`.
ingestion/src/metadata/ingestion/source/search/elasticsearch/parser.py	Promotes Elasticsearch mapping parse failure logging to `ERROR`.
ingestion/src/metadata/ingestion/source/pipeline/spline/utils.py	Promotes Spline parsing failure logging to `ERROR`.
ingestion/src/metadata/ingestion/source/pipeline/spline/metadata.py	Promotes datasource parsing failure logging to `ERROR`.
ingestion/src/metadata/ingestion/source/pipeline/kafkaconnect/metadata.py	Promotes multiple KafkaConnect ingestion error logs to `ERROR`.
ingestion/src/metadata/ingestion/source/pipeline/kafkaconnect/client.py	Promotes KafkaConnect client exception logs to `ERROR`.
ingestion/src/metadata/ingestion/source/pipeline/gluepipeline/metadata.py	Promotes GluePipeline S3 script download failure logging to `ERROR`.
ingestion/src/metadata/ingestion/source/pipeline/domopipeline/metadata.py	Promotes DomoPipeline source URL retrieval failure logging to `ERROR`.
ingestion/src/metadata/ingestion/source/pipeline/dbtcloud/client.py	Promotes dbt Cloud client exception logs to `ERROR`.
ingestion/src/metadata/ingestion/source/pipeline/airflow/api/client.py	Promotes Airflow API JSON parse failure logging to `ERROR`.
ingestion/src/metadata/ingestion/source/pipeline/airbyte/metadata.py	Promotes Airbyte timestamp parsing failure logging to `ERROR`.
ingestion/src/metadata/ingestion/source/messaging/pubsub/metadata.py	Promotes PubSub metadata/schema parsing failure logs to `ERROR`.
ingestion/src/metadata/ingestion/source/messaging/common_broker_source.py	Promotes schema-with-references parsing failure logging to `ERROR`.
ingestion/src/metadata/ingestion/source/mcp/connection.py	Promotes MCP connection failure logging to `ERROR`.
ingestion/src/metadata/ingestion/source/mcp/client.py	Promotes MCP client parsing/sending failure logs to `ERROR`.
ingestion/src/metadata/ingestion/source/drive/sftp/metadata.py	Promotes SFTP sample/schema extraction failure logging to `ERROR`.
ingestion/src/metadata/ingestion/source/database/lineage_source.py	Promotes lineage queue processing failure logs to `ERROR`.
ingestion/src/metadata/ingestion/source/database/external_table_lineage_mixin.py	Promotes external table lineage yield failure logging to `ERROR`.
ingestion/src/metadata/ingestion/source/database/common_db_source.py	Promotes SQLAlchemy engine dispose failure logging to `ERROR`.
ingestion/src/metadata/ingestion/source/dashboard/superset/db_source.py	Promotes Superset chart list fetch failure logging to `ERROR`.
ingestion/src/metadata/ingestion/source/dashboard/qliksense/client.py	Promotes QlikSense client fetch failure logs to `ERROR`.
ingestion/src/metadata/ingestion/source/dashboard/qlikcloud/client.py	Promotes QlikCloud client fetch failure logs to `ERROR`.
ingestion/src/metadata/ingestion/source/dashboard/microstrategy/client.py	Promotes MicroStrategy client fetch failure logs to `ERROR`.
ingestion/src/metadata/ingestion/source/dashboard/metabase/client.py	Promotes Metabase client fetch failure logs to `ERROR`.
ingestion/src/metadata/ingestion/source/api/rest/parser.py	Promotes OpenAPI schema parse failure logs to `ERROR`.
ingestion/src/metadata/ingestion/ometa/mixins/container_mixin.py	Promotes container sample parsing failure logging to `ERROR`.
ingestion/src/metadata/ingestion/connections/test_connections.py	Promotes automation workflow step failure logging to `ERROR`.
ingestion/src/metadata/ingestion/bulksink/metadata_usage.py	Promotes metadata usage publish failure logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/table/sqlalchemy/tableDiff.py	Promotes Data Diff unsupported dialect logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/table/base/tableRowInsertedCountToBeBetween.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/table/base/tableRowCountToEqual.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/table/base/tableRowCountToBeBetween.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/table/base/tableCustomSQLQuery.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/table/base/tableColumnToMatchSet.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/table/base/tableColumnNameToExist.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/table/base/tableColumnCountToEqual.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/table/base/tableColumnCountToBeBetween.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/column/base/columnValuesToNotMatchRegex.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/column/base/columnValuesToMatchRegex.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/column/base/columnValuesToBeUnique.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/column/base/columnValuesToBeNotNull.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/column/base/columnValuesToBeNotInSet.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/column/base/columnValuesToBeInSet.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/column/base/columnValuesToBeBetween.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/column/base/columnValuesToBeAtExpectedLocation.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/column/base/columnValuesSumToBeBetween.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/column/base/columnValuesMissingCount.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/column/base/columnValueStdDevToBeBetween.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/column/base/columnValueMinToBeBetween.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/column/base/columnValueMedianToBeBetween.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/column/base/columnValueMeanToBeBetween.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/column/base/columnValueMaxToBeBetween.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/data_quality/validations/column/base/columnValueLengthsToBeBetween.py	Promotes validation abort logging to `ERROR`.
ingestion/src/metadata/clients/domo_client.py	Promotes Domo client exception logs to `ERROR`.

Copilot · 2026-04-24T22:45:18Z

        except Exception as exc:
            logger.debug(traceback.format_exc())
-            logger.warning(f"Failed to close the api sesison due to [{exc}]")
+            logger.error(f"Failed to close the api sesison due to [{exc}]")


Typo in log message: "sesison" should be "session".

Copilot · 2026-04-24T22:45:18Z

        except Exception as exc:
            logger.debug(traceback.format_exc())
-            logger.warning(f"Unable to get run info :{exc}")
+            logger.error(f"Unable to get run info :{exc}")


Minor formatting issue in the error message: there is an extra space before the colon ("info :{exc}"). Consider changing it to a consistent "info: {exc}" format for readability/searchability.

Copilot · 2026-04-24T22:45:19Z

        except Exception as exc:
            logger.debug(traceback.format_exc())
-            logger.warning(f"Unable to get connector plugins  {exc}")
+            logger.error(f"Unable to get connector plugins  {exc}")


There are two consecutive spaces before the exception in this message ("plugins {exc}"). Tightening the formatting helps keep logs consistent and easier to grep.

Copilot · 2026-04-24T22:45:19Z

+                logger.error(f"Unknown error while processing {row} - {exc}")
                logger.debug(traceback.format_exc())


This error log includes the raw sample value (row), which can contain PII. Since this is the PII scanner, emitting sample data at ERROR level risks leaking sensitive data into centralized logs. Consider redacting the value (e.g., log row index/length/hash) or logging a generic message and rely on exception/traceback for debugging.

Copilot · 2026-04-24T22:45:19Z

        except Exception:
            logger.debug(traceback.format_exc())
-            logger.warning(f"Unable to get the region for bucket: {bucket_name}")
+            logger.error(f"Unable to get the region for bucket: {bucket_name}")
        return region or self.service_connection.awsConfig.awsRegion


This except Exception: branch logs at ERROR level but drops the exception details (no as exc) and only logs the bucket name. To make the error actionable, capture the exception and include it (or use logger.exception(...) / logger.error(..., exc_info=True)).

Copilot · 2026-04-24T22:45:19Z

        logger.debug(traceback.format_exc())
-        logger.warning(f"Unable to parse the index properties: {exc}")
+        logger.error(f"Unable to parse the index properties: {exc}")



The traceback is logged at DEBUG while the message is now ERROR. If ERROR logs are what get collected/alerted on, the stack trace will be missing. Consider logging the exception with traceback at the same level (e.g., logger.exception(...) or logger.error(..., exc_info=True)) and avoid the separate traceback.format_exc() call.

Copilot · 2026-04-24T22:45:20Z

        except Exception as err:
            logger.debug(traceback.format_exc())
-            logger.warning(f"Failed to fetch chart list due to - {err}]")
+            logger.error(f"Failed to fetch chart list due to - {err}]")


The log message ends with an extra closing bracket (]), which makes the error output look malformed. Remove the trailing bracket or ensure brackets are paired consistently.

github-actions · 2026-04-25T01:06:01Z

🟡 Playwright Results — all passed (19 flaky)

✅ 3955 passed · ❌ 0 failed · 🟡 19 flaky · ⏭️ 86 skipped

Shard	Passed	Flaky	Skipped
🟡 Shard 1	294	5	4
🟡 Shard 2	739	5	8
🟡 Shard 3	747	1	7
🟡 Shard 4	756	3	18
🟡 Shard 5	686	1	41
🟡 Shard 6	733	4	8

🟡 19 flaky test(s) (passed on retry)

Pages/UserCreationWithPersona.spec.ts › Create user with persona and verify on profile (shard 1, 1 retry)
Flow/SearchRBAC.spec.ts › User without permission (shard 1, 1 retry)
Flow/SearchRBAC.spec.ts › User without permission (shard 1, 2 retries)
Flow/SearchRBAC.spec.ts › User without permission (shard 1, 1 retry)
Flow/SearchRBAC.spec.ts › User without permission (shard 1, 1 retry)
Features/ActivityAPI.spec.ts › Activity event is created when description is updated (shard 2, 1 retry)
Features/ActivityAPI.spec.ts › Activity event is created when owner is added (shard 2, 1 retry)
Features/CuratedAssets.spec.ts › Test Search Indexes with display name filter (shard 2, 1 retry)
Features/Glossary/GlossaryWorkflow.spec.ts › should start term as Draft when glossary has reviewers (shard 2, 1 retry)
Features/LandingPageWidgets/DomainWidgetFilter.spec.ts › Domains widget should show only selected domain when domain filter is active (shard 2, 1 retry)
Features/RTL.spec.ts › Verify Following widget functionality (shard 3, 1 retry)
Pages/DataContractsSemanticRules.spec.ts › Validate Owner Rule Is_Not (shard 4, 1 retry)
Pages/DataContractsSemanticRules.spec.ts › Validate DataProduct Rule Any_In (shard 4, 1 retry)
Pages/Domains.spec.ts › Multiple consecutive domain renames preserve all associations (shard 4, 1 retry)
Pages/EntityDataConsumer.spec.ts › UpVote & DownVote entity (shard 5, 1 retry)
Pages/Lineage/LineageFilters.spec.ts › Verify lineage schema filter selection (shard 6, 1 retry)
Pages/Lineage/LineageRightPanel.spec.ts › Verify custom properties tab IS visible for supported type: searchIndex (shard 6, 1 retry)
Pages/ODCSImportExport.spec.ts › Multi-object ODCS contract - object selector shows all schema objects (shard 6, 1 retry)
Pages/ServiceEntity.spec.ts › User as Owner Add, Update and Remove (shard 6, 1 retry)

📦 Download artifacts

How to debug locally

# Download playwright-test-results-<shard> artifact and unzip
npx playwright show-trace path/to/trace.zip    # view trace

github-actions · 2026-04-27T13:16:19Z

The Python checkstyle failed.

Please run make py_format and py_format_check in the root of your repository and commit the changes to this PR.
You can also use pre-commit to automate the Python code formatting.

You can install the pre-commit hooks with make install_test precommit_install.

gitar-bot · 2026-04-27T16:02:11Z

Code Review ⚠️ Changes requested 0 resolved / 1 findings

Promotes all warning logs to errors, but this blanket change misclassifies non-critical failures and requires a more granular logging approach.

⚠️

Quality: Blanket warning→error promotion misclassifies non-critical failures

📄 ingestion/src/metadata/ingestion/source/api/rest/parser.py:57 📄 ingestion/src/metadata/ingestion/source/api/rest/parser.py:62 📄 ingestion/src/metadata/ingestion/source/mcp/connection.py:146 📄 ingestion/src/metadata/ingestion/connections/test_connections.py:138 📄 ingestion/src/metadata/ingestion/source/pipeline/kafkaconnect/metadata.py:234 📄 ingestion/src/metadata/ingestion/source/pipeline/kafkaconnect/metadata.py:296 📄 ingestion/src/metadata/ingestion/source/storage/s3/metadata.py:760

This PR promotes all logger.warning() calls to logger.error() without distinguishing between genuinely erroneous conditions and expected/graceful degradation scenarios. In many logging frameworks and monitoring setups, ERROR level triggers alerts and pages on-call engineers, while WARNING is for recoverable or expected-but-unusual situations.

Several of these changes are semantically incorrect:

parse_openapi_schema (parser.py:57,62) — When JSON parsing fails here, the code falls through to a generic JSON→YAML fallback (lines 64-80). This is normal control flow, not an error. Logging at ERROR on a successful fallback path will generate false alarm noise.
test_server_connection (connection.py:146) — This is a test function designed to return True/False. A failed connection is an expected outcome, not an error. (Also note the inconsistency: McpProtocolError is now error while the generic Exception on line 149 stays warning.)
_test_connection_steps_automation_workflow (test_connections.py:138) — Test connection steps are expected to fail sometimes; the result is reported back to the user via TestConnectionStepResult. Logging each failure as ERROR is misleading.
find_database_service_by_hostname / find_messaging_service_by_brokers (kafkaconnect/metadata.py:234,296) — These return None on failure and callers handle the absence gracefully. A missing service match is a warning, not an error.
get_aws_bucket_region (s3/metadata.py:760) — Falls back to the configured region on line 761. This is graceful degradation.

Recommendation: Review each call site individually. Reserve logger.error() for conditions that indicate something is broken and needs attention (e.g., data corruption, unrecoverable API failures). Keep logger.warning() for expected-but-unusual situations where the code continues gracefully.

Suggested fix

Audit each site individually. A helpful heuristic:
- ERROR: The operation failed AND the caller cannot recover
  (e.g., data loss, broken pipeline)
- WARNING: The operation failed BUT the code degrades
  gracefully (e.g., returns None, uses a fallback)
- DEBUG: Expected control-flow branches

🤖 Prompt for agents

Code Review: Promotes all warning logs to errors, but this blanket change misclassifies non-critical failures and requires a more granular logging approach.

1. ⚠️ Quality: Blanket warning→error promotion misclassifies non-critical failures
   Files: ingestion/src/metadata/ingestion/source/api/rest/parser.py:57, ingestion/src/metadata/ingestion/source/api/rest/parser.py:62, ingestion/src/metadata/ingestion/source/mcp/connection.py:146, ingestion/src/metadata/ingestion/connections/test_connections.py:138, ingestion/src/metadata/ingestion/source/pipeline/kafkaconnect/metadata.py:234, ingestion/src/metadata/ingestion/source/pipeline/kafkaconnect/metadata.py:296, ingestion/src/metadata/ingestion/source/storage/s3/metadata.py:760

   This PR promotes **all** `logger.warning()` calls to `logger.error()` without distinguishing between genuinely erroneous conditions and expected/graceful degradation scenarios. In many logging frameworks and monitoring setups, ERROR level triggers alerts and pages on-call engineers, while WARNING is for recoverable or expected-but-unusual situations.
   
   Several of these changes are semantically incorrect:
   
   1. **`parse_openapi_schema` (parser.py:57,62)** — When JSON parsing fails here, the code falls through to a generic JSON→YAML fallback (lines 64-80). This is normal control flow, not an error. Logging at ERROR on a successful fallback path will generate false alarm noise.
   
   2. **`test_server_connection` (connection.py:146)** — This is a *test* function designed to return `True`/`False`. A failed connection is an expected outcome, not an error. (Also note the inconsistency: `McpProtocolError` is now `error` while the generic `Exception` on line 149 stays `warning`.)
   
   3. **`_test_connection_steps_automation_workflow` (test_connections.py:138)** — Test connection steps are expected to fail sometimes; the result is reported back to the user via `TestConnectionStepResult`. Logging each failure as ERROR is misleading.
   
   4. **`find_database_service_by_hostname` / `find_messaging_service_by_brokers` (kafkaconnect/metadata.py:234,296)** — These return `None` on failure and callers handle the absence gracefully. A missing service match is a warning, not an error.
   
   5. **`get_aws_bucket_region` (s3/metadata.py:760)** — Falls back to the configured region on line 761. This is graceful degradation.
   
   Recommendation: Review each call site individually. Reserve `logger.error()` for conditions that indicate something is broken and needs attention (e.g., data corruption, unrecoverable API failures). Keep `logger.warning()` for expected-but-unusual situations where the code continues gracefully.

   Suggested fix:
   Audit each site individually. A helpful heuristic:
   - ERROR: The operation failed AND the caller cannot recover
     (e.g., data loss, broken pipeline)
   - WARNING: The operation failed BUT the code degrades
     gracefully (e.g., returns None, uses a fallback)
   - DEBUG: Expected control-flow branches

Options

Display: compact → Showing less information.

Comment with these commands to change:

`Compact`
`gitar display:verbose`

_{Was this helpful? React with 👍 / 👎 | Gitar}

Copilot

Pull request overview

Copilot reviewed 57 out of 57 changed files in this pull request and generated 7 comments.

Comments suppressed due to low confidence (1)

ingestion/src/metadata/ingestion/source/mcp/connection.py:140

This method now logs McpProtocolError as ERROR, but the generic except Exception branch still logs as WARNING. Given the PR’s intent (“move warning logs to error logs” for caught exceptions), consider making the unexpected-exception branch logger.error/logger.exception as well so unexpected connection failures are consistently surfaced as errors (and include traceback if possible).

Copilot · 2026-04-27T16:04:53Z

            logger.debug(traceback.format_exc())
-            logger.warning(f"Failed to close the api sesison due to [{exc}]")
+            logger.error(f"Failed to close the api sesison due to [{exc}]")



Typo in log message: sesison should be session.

Copilot · 2026-04-27T16:04:54Z

        try:
            self.engine.dispose()
        except Exception as exc:  # pylint: disable=broad-except
-            logger.warning(f"Failed to dispose engine: {exc}")
+            logger.error(f"Failed to dispose engine: {exc}")
        self.engine = None


_release_engine now logs disposal failures at ERROR but drops the traceback (logger.error(f"... {exc}")). For troubleshooting connection/pool cleanup issues, include exception context via logger.exception(...) or logger.error(..., exc_info=True) so the stack trace is available at error level.

Copilot · 2026-04-27T16:04:54Z

            try:
                self.process_data(row=row, entities_score=entities_score)
            except Exception as exc:
-                logger.warning(f"Unknown error while processing {row} - {exc}")
+                logger.error(f"Unknown error while processing {row} - {exc}")
                logger.debug(traceback.format_exc())


The exception handling logs an ERROR but the stack trace is still only emitted at DEBUG via traceback.format_exc(), which likely defeats the purpose of “capture errors” in typical INFO/ERROR deployments. Prefer logger.exception(...) (or logger.error(..., exc_info=True)) here so the traceback is attached to the error-level log, and you can drop the separate debug traceback log to avoid duplication.

Copilot · 2026-04-27T16:04:54Z

    except Exception as exc:
-        logger.warning(f"Failed to parse jdbc url: {exc}")
+        logger.error(f"Failed to parse jdbc url: {exc}")
        logger.error(traceback.format_exc())



Same as above for parse_jdbc_url: emitting traceback.format_exc() as a separate ERROR line is noisy and drops structured exception info. Prefer logger.exception(...) (or logger.error(..., exc_info=True)) and remove the extra traceback log line.

Copilot · 2026-04-27T16:04:55Z

        with patch("metadata.ingestion.source.mcp.client.logger") as mock_logger:
            transport.send_notification("notifications/initialized", {})
-            mock_logger.warning.assert_called_once()
-            assert "server down" in str(mock_logger.warning.call_args)
+            mock_logger.error.assert_called_once()
+            assert "server down" in str(mock_logger.error.call_args)



The test docstring/comment still says send_notification should log warnings but the assertion now expects logger.error. Update the docstring/comment to match the new behavior so the test remains self-descriptive.

Copilot · 2026-04-27T16:04:55Z

 def parse_dbfs_path(path: str) -> Optional[str]:
    try:
        return path.split("/")[-1]
    except Exception as exc:
-        logger.warning(f"Failed to parse dbfs: {exc}")
+        logger.error(f"Failed to parse dbfs: {exc}")
        logger.error(traceback.format_exc())
    return None


parse_dbfs_path logs the traceback by stringifying traceback.format_exc() as a second ERROR line. This loses structured exception context and produces duplicate log lines. Prefer logger.exception(...) (or logger.error(..., exc_info=True)) so the traceback is included with the primary message, and remove the separate logger.error(traceback.format_exc()).

Copilot · 2026-04-27T16:04:55Z

        except Exception:
            logger.debug(traceback.format_exc())
-            logger.warning(f"Unable to get the region for bucket: {bucket_name}")
+            logger.error(f"Unable to get the region for bucket: {bucket_name}")
        return region or self.service_connection.awsConfig.awsRegion


In the exception path, the traceback is only logged at DEBUG, while the ERROR log omits exception context. If the goal is to improve observability of failures, use logger.exception(...) (or logger.error(..., exc_info=True)) so the traceback is captured at error level and you don’t rely on DEBUG logs being enabled.

Resolved conflicts in two files where main's PR #27728 swapped `logger.warning` → `logger.error` while Stage 2c added noqa markers: - ingestion/src/metadata/clients/domo_client.py:168 (TRY201 noqa preserved) - ingestion/src/metadata/ingestion/source/pipeline/spline/utils.py two log blocks (TRY400 noqa preserved on traceback.format_exc lines) Re-ran `ruff check --add-noqa` on the merged files; main's auto-merged warning→error swap introduced 94 new TRY400 violations across data-quality validators which are now grandfathered in line. Regenerated basedpyright baseline in a Linux Docker container so column positions match CI (Ubuntu) exactly. Local macOS arm64 will see column- shifted entries due to stub differences between macOS and Linux wheels; `--baselinemode=discard` mode tolerates the drift downstream. Pinned `pythonPlatform = "Linux"` in pyproject.toml so CI analysis is platform- stable across runners.

Per gitar-bot's review on PR #27774: 1. Main's PR #27728 promoted ~60 `logger.warning()` → `logger.error()` inside `except` blocks. Those changes landed on main with their own baseline updates. Our PR doesn't promote anything — the merge from origin/main brought those `error` calls along with their baseline entries. The bot interpreted the `# noqa: TRY400` we added next to those lines as us silencing the rule case-by-case. Cleaner: globally ignore TRY400 in pyproject.toml, with a comment explaining why the codebase's `logger.error(...)` + separate `logger.debug(traceback.format_exc())` pattern is intentional. Strip ~430 per-line `# noqa: TRY400` markers from source. 2. Document that `S101` in `per-file-ignores` is a forward-looking entry — flake8-bandit (`S`) is not yet selected, so the rule is no-op today; the entry stays so when `S` lands later, tests don't immediately error. Reverts the platform pin and Linux Docker–generated baseline. Keep main's baseline intact and let CI surface the exact column-shifted entries; the team will decide whether to fix in-place (revert format on affected files) or add per-line `# pyright: ignore` markers.

chore: move warning logs to error logs

c6a0d5e

Copilot AI review requested due to automatic review settings April 24, 2026 22:41

TeddyCr requested a review from a team as a code owner April 24, 2026 22:41

github-actions Bot added Ingestion safe to test Add this label to run secure Github workflows on PRs labels Apr 24, 2026

Copilot started reviewing on behalf of TeddyCr April 24, 2026 22:41 View session

gitar-bot Bot reviewed Apr 24, 2026

View reviewed changes

Copilot AI reviewed Apr 24, 2026

View reviewed changes

TeddyCr had a problem deploying to test April 24, 2026 22:52 — with GitHub Actions Failure

TeddyCr temporarily deployed to test April 24, 2026 22:52 — with GitHub Actions Inactive

TeddyCr had a problem deploying to test April 24, 2026 22:52 — with GitHub Actions Failure

harshach previously approved these changes Apr 24, 2026

View reviewed changes

TeddyCr enabled auto-merge (squash) April 24, 2026 23:32

Merge branch 'main' into MINOR-Fix-Logging

da2f866

TeddyCr dismissed harshach’s stale review via da2f866 April 27, 2026 13:10

TeddyCr temporarily deployed to test April 27, 2026 13:21 — with GitHub Actions Inactive

TeddyCr added 2 commits April 27, 2026 08:53

Merge remote-tracking branch 'upstream/main' into MINOR-Fix-Logging

2d5dfcb

chore: fix python linting

8c79ee6

Copilot AI review requested due to automatic review settings April 27, 2026 16:00

Copilot AI reviewed Apr 27, 2026

View reviewed changes

TeddyCr temporarily deployed to test April 27, 2026 16:12 — with GitHub Actions Inactive

TeddyCr disabled auto-merge April 27, 2026 18:28

TeddyCr merged commit 1fd724b into main Apr 27, 2026
64 of 70 checks passed

TeddyCr deleted the MINOR-Fix-Logging branch April 27, 2026 18:28

IceS2 mentioned this pull request Apr 27, 2026

chore(ingestion): drop pylint, expand ruff #27774

Open

		logger.error(f"Unknown error while processing {row} - {exc}")
		logger.debug(traceback.format_exc())

Conversation

TeddyCr commented Apr 24, 2026 • edited by gitar-bot Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe your changes:

Type of change:

Checklist:

Summary by Gitar

Uh oh!

gitar-bot Bot Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Apr 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🟡 Playwright Results — all passed (19 flaky)

Uh oh!

github-actions Bot commented Apr 27, 2026

Uh oh!

gitar-bot Bot commented Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

TeddyCr commented Apr 24, 2026 •

edited by gitar-bot Bot

Loading

github-actions Bot commented Apr 25, 2026 •

edited

Loading

gitar-bot Bot commented Apr 27, 2026 •

edited

Loading