observability is:
- collection of data from production systems
- answer questions before you need them
- tools for exploration & visualization
- allows the analyze -> react -> response workflow
Data:
- apm
- logs
- metrics
- traces
they do ci/cd process analysis w/ otel via otel and cd pipelines
They have some issues where tests span multiple teams. So they use o11y to hold accountable & notify the right folks.
Also using otel to make it clearer which tests were pass/failed, etc.
otel-cli is a project which will wrap shell scripts.
pytest-otel - otel hooks for pytest.
jenkins: opentelemetry-plugin
They are building a thing which takes out the junit xml, reads in some relevant git context, and publishes it up to the trace/metrics service.
They have junit2otel cli: https://github.com/mdelapenya/junit2otlp