Use external observability tools

This topic tells you how to generate metrics for your Tanzu Application Platform (commonly known as TAP) applications to enable observability with external tools. It explains how to set up Prometheus on your cluster or integrate an existing Datadog installation.

About Prometheus metrics

Prometheus is an open-source monitoring tool that defines a simple text-based metrics format with client libraries for a wide range of programming languages and frameworks.

By integrating these client libraries into your applications, you can effortlessly generate metrics in the Prometheus format. These metrics are accessible through an HTTP endpoint, enabling Prometheus and other observability tools to conveniently consume (scrape) them.

For Kubernetes, there is an established convention that facilitates the automatic discovery of pods exposing Prometheus metrics. This involves incorporating specific annotations on the pods exposing information for path and port of the metrics endpoint.

Prometheus and other observability tools like Datadog can discover annotated pods and collect the metrics from the endpoint.

Use Prometheus as your observability tool

There are multiple ways to install Prometheus on your cluster:

Use the Prometheus Operator
Use the kube-prometheus-stack Helm chart
Use the Prometheus Helm chart

Prometheus Operator : The Prometheus Operator offers a simplified way to use custom resources to deploy and configure Prometheus, Alertmanager, and related monitoring components.

To install using the Prometheus Operator:

Install the Prometheus Operator bundle by running:

LATEST=$(curl -s https://api.github.com/repos/prometheus-operator/prometheus-operator/releases/latest | jq -cr .tag_name)
curl -sL https://github.com/prometheus-operator/prometheus-operator/releases/download/${LATEST}/bundle.yaml | kubectl create -f -

For RBAC-based environments, create the RBAC rules for the Prometheus service account by running:

cat <<EOF | kubectl apply -f -
apiVersion: v1
kind: ServiceAccount
metadata:
  name: prometheus
---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRole
metadata:
  name: prometheus
rules:
- apiGroups: [""]
  resources:
  - nodes
  - nodes/metrics
  - services
  - endpoints
  - pods
  verbs: ["get", "list", "watch"]
- apiGroups: [""]
  resources:
  - configmaps
  verbs: ["get"]
- apiGroups:
  - networking.k8s.io
  resources:
  - ingresses
  verbs: ["get", "list", "watch"]
- nonResourceURLs: ["/metrics"]
  verbs: ["get"]
---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRoleBinding
metadata:
  name: prometheus
roleRef:
  apiGroup: rbac.authorization.k8s.io
  kind: ClusterRole
  name: prometheus
subjects:
- kind: ServiceAccount
  name: prometheus
  namespace: default
EOF

Configure a Prometheus scraping job. This job must monitor all pods that are marked with the designated Prometheus annotations.

Note The Prometheus Operator does not support annotation-based discovery of services by default. To enable these annotations, you must set up a custom scrape job configuration.

{{> 'partials/integrations/create-config-secret' }}
1. Create a Prometheus resource that uses this secret by running:
```
cat <<EOF | kubectl apply -f -
apiVersion: monitoring.coreos.com/v1
kind: Prometheus
metadata:
  name: prometheus
spec:
  serviceAccountName: prometheus
  podMonitorSelector: {}
  additionalScrapeConfigs:
    name: additional-scrape-configs
    key: prometheus-scrape-config.yaml
  resources:
    requests:
      memory: 400Mi
  enableAdminAPI: false
EOF
```
  The Prometheus Operator automatically detects the configuration and generates a scrape job from it. This job is executed regularly by the Prometheus instance that is set up.
To access the Prometheus web interface, you must make port 9090 of the Prometheus server pod accessible outside the cluster through a Kubernetes service or ingress. For development purposes, you can forward the port to your local machine using kubectl.

To see the scrape configuration that the Prometheus instance has picked up, run:

kubectl get secret prometheus-prometheus -ojson | jq -r '.data["prometheus.yaml.gz"]' | base64 -d | gunzip

For more information about the Prometheus Operator, including how to persist your metrics or activate alerting features, see the Prometheus Operator documentation.

kube-prometheus-stack Helm chart : The Prometheus community has developed the kube-prometheus-stack Helm chart, which establishes a comprehensive cluster monitoring stack. The Helm chart sets up various scraping jobs for metrics related to the Kubernetes cluster.

These are some of its features:

Includes the Prometheus Operator
Includes cluster monitoring
Ensures high availability
Integrates Node Exporter Grafana dashboards

For more information, see the kube-prometheus-stack Helm chart README in GitHub.

To install the kube-prometheus-stack Helm chart:

Add the Helm repository by running:

helm repo add prometheus-community https://prometheus-community.github.io/helm-charts
helm repo update

Because there isn't a predefined scrape job configuration to support annotation-based discovery of services, you must create the configuration secret additional-scrape-configs.

{{> 'partials/integrations/create-config-secret' }}

Install the kube-prometheus-stack Helm chart by running:

helm upgrade --install kube-prometheus-stack prometheus-community/kube-prometheus-stack \
  --set prometheus.prometheusSpec.additionalScrapeConfigsSecret.name=additional-scrape-configs \
  --set prometheus.prometheusSpec.additionalScrapeConfigsSecret.key=prometheus-scrape-config.yaml \
  --set prometheus.prometheusSpec.additionalScrapeConfigsSecret.enabled=true

To see the scrape configuration that the Prometheus instance has picked up, run:

kubectl get secret prometheus-kube-prometheus-stack-prometheus -ojson | jq -r '.data["prometheus.yaml.gz"]' | base64 -d | gunzip

Prometheus Helm chart : The Prometheus community provides the Prometheus Helm chart to install Prometheus on your Kubernetes cluster. The Helm chart installs scraping job configurations for pods and services tagged with Prometheus scraping annotations and sets up scraping for metrics related to the Kubernetes cluster. It includes key components such as Alert Manager, kube-state-metrics, Node Exporter, and Push Gateway.

For more information, see the Prometheus Helm chart README in GitHub.

To install the Prometheus Helm chart:

Add the Helm repository by running:

helm repo add prometheus-community https://prometheus-community.github.io/helm-charts
helm repo update

Install the Prometheus Helm chart by running:

helm upgrade --install prometheus prometheus-community/prometheus

Use Datadog as your observability tool

If you use Datadog, you can use it to scrape the Prometheus endpoints without having to install Prometheus itself. Datadog automatically gathers Prometheus metrics from pods that are annotated with the default Prometheus annotations. The Datadog Agent forwards the metrics to the Datadog servers.

To use Datadog as your observability tool:

Add the Helm repository by running:

helm repo add datadog https://helm.datadoghq.com
helm repo update

Install the Datadog Agent Helm chart by running:
```
helm upgrade --install datadog-operator datadog/datadog-operator
```
Note There is a known issue with Datadog Cluster Agent on Azure Kubernetes Service (AKS) clusters. For more information, see the troubleshooting item Datadog agent cannot reconcile webhook on AKS.
Generate a new API key in Datadog for the Agent that will push metrics to Datadog. You do this in the Datadog UI, under Profile/Organization Settings/API Keys.
Create a secret for the Datadog API key by running:
```
kubectl create secret generic datadog-secret --from-literal api-key=API-KEY
```
Where API-KEY is the API you generated in the previous step.

Install the Datadog Agent by running:

cat <<EOF | kubectl apply -f -
apiVersion: datadoghq.com/v2alpha1
kind: DatadogAgent
metadata:
  name: datadog
spec:
  global:
    clusterName: YOUR-CLUSTER-NAME
    site: DATADOG-HOST-NAME
    credentials:
      apiSecret:
        secretName: datadog-secret
        keyName: api-key
  features:
    prometheusScrape:
      enabled: true
      enableServiceEndpoints: true
EOF

Where:

YOUR-CLUSTER-NAME is the name of your cluster as you want to see it in Datadog.
DATADOG-HOST-NAME is your Datadog host name, for example, datadoghq.eu.

Enable metric collection on Spring Boot workloads

To enable Spring Boot workloads to create Prometheus metrics, if using Maven, add the following dependencies to your pom.xml:

<dependencies>
    <dependency>
      <groupId>org.springframework.boot</groupId>
      <artifactId>spring-boot-starter-actuator</artifactId>
    </dependency>
    <dependency>
      <groupId>io.micrometer</groupId>
      <artifactId>micrometer-registry-prometheus</artifactId>
      <scope>runtime</scope>
    </dependency>
</dependencies>

This creates default metrics for the JVM, HTTP traffic, and more. For a list of supported metrics, see the Spring Boot documentation.

When deploying Spring Boot workloads on Tanzu Application Platform, the Spring Boot conventions ensure that actuator endpoints are exposed. The Prometheus metrics endpoint is also made accessible.

For Prometheus to find this endpoint on a pod, you must include specific annotations in your workload.yaml file. These annotations must align with your configuration. After adding these annotations, deploy the changes:

apiVersion: carto.run/v1alpha1
kind: Workload
metadata:
  name: spring-petclinic
  annotations:
    prometheus.io/scrape: 'true'
    prometheus.io/path: '/actuator/prometheus'
    prometheus.io/port: '8081'
    # ...

For more information about the Spring Boot Prometheus integration, see the Spring Boot Reference Documentation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!