databricks-opentelemetry's People
databricks-opentelemetry's Issues
Expose Structured Streaming backlog metrics
Streaming sources in Spark emit backlog metrics that indicate how behind a stream is. These metrics are helpful in determining a stream's performance relative to its source.
Streaming sources such as Autoloader, Kinesis, Kafka, and Delta emit these metrics.
This issue is to track efforts related to exposing these as metrics that can be centralized using OpenTelemetry.
Grafana automated install
Capture cluster event log events
This issue is to track any efforts related to emitting metrics or logs that relate to a cluster's event log.
Tracing examples and configs when model serving / calling LLM
When serving models / LLMs, latency is often very important but can be difficult to attribute, especially in complicated systems with multiple steps and systems.
Logs can be super helpful for troubleshooting, but collating logs across multiple systems isn't very easy and logs aren't easily consumed by humans. Tracing is often seen as a better alternative as it gives gantt charts out of the box, even with multiple systems with tools like Jaeger.
This issue is to track showing how to implement tracing for Databricks ML serving applications.
Jaeger automated install
Expose advanced scan metrics
It's helpful to detect things like cloud storage throttling, request count, etc. We should find a way to expose these metrics to OpenTelemetry.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.