You can use LightStep’s Service diagram to get an aggregate view of trace data as a request travels through your system. The Service diagram provides a visual, interactive, and hierarchical representation of a system’s behavior for a given point in time, based on the query shown in Explorer. It also provides a clear visualization of inter-service relationships and insight into the performance of distributed software. You can see services both upstream and downstream from the queried service and pinpoint services that contribute to the latency of the request. The Service diagram also allows you to easily visualize a complex system architecture, identify services with errors, and quickly formulate or eliminate hypotheses.
The diagram edges represent relationships between spans and is dependent on the quality of your instrumentation. Missing spans may cause edges to appear when there is no parent-child relationship.
The following can cause missing spans:
* The Satellite dropped the span (for example, your Satellite pool is not auto-scaling to keep up with traffic)
* The tracer dropped the span (for example, your application crashed or never called
* There is an issue with your instrumentation (context was dropped, or the service with a parent span is not instrumented).
When you see missing spans, check the Reporting Status page to find the culprit.
The queried service is shown with an animated border. Services that contribute to latency are shown with a yellow halo - the larger the halo, the larger the latency. Services with errors are shown with a red halo.
Latency halos only display when your query contains a service.
The left panel shows operations on the currently selected service with exemplar spans. You can click a span to see it in context in the Trace view.
View the Service Diagram
You access the Service diagram by clicking the Service Diagram tab in Explorer. The service you queried for is centered in the diagram and shows an animated blue circle.
The blue animated border indicates that this is the queried service. Descendant services display a yellow halo to indicate their contribution to the latency experienced by the queried service.
The yellow halo indicates that the service contributes to the latency experienced by the queried service. The size of the halo indicates the size of the latency contribution. The yellow halo only appears when a service is explicitly added as a query term (if your query doesn't include a service, latency halos aren't displayed).
The blue dot in the center of a node indicates the currently selected service. See the side panel for this service’s reporting status and a sample list of operations. You select a service by clicking the center of the node. Selecting a service changes the information in the side panel to reflect that service.
The red halo indicates that the service is experiencing errors.
Nodes appear greyed-out when traces from this service don’t match a filter in the Latency Histogram or Trace Analysis table.
Hover over a service to view its status.
Click the center of a service to select it and see its information in the side panel.
The Service Diagram helps you quickly find the ancestor and descendant services of a given service and whether they contribute to latency. In the graphic above, a user has queried for
api-server and the diagram clearly distinguishes between the
api-proxy that is sending traffic to
api-server and the four services that are receiving traffic from the
database service shows significant latency and may need further investigation.
Change the Service Diagram’s Display
You can move the diagram, zoom in and out, and you can center in on a service using the controls at the top. Click Focus on service to see just the immediate upstream and downstream services.
Move the diagram by clicking and dragging.
Selecting and centering on a different service populates the panel with information about that service (but does not run a query).
Similar to how you can filter the spans from the Trace Analysis table, you can filter spans from the side panel of the Service diagram.
When you click a filter icon on a service or operation, the Trace Analysis table repopulates to show spans from the results that match the filter. Results are taken from all spans that participate in the same traces as the original query.