Deploy Prometheus Monitoring with Prometheus Operator

This guide outlines the deployment process for a custom Prometheus monitoring setup using the Prometheus Operator.

Prerequisites

Ensure your Kubernetes cluster has:

Deployment Flow Chart

Below is a visual representation of the Prometheus deployment process:

This flow chart illustrates the key steps in deploying Prometheus monitoring using the Prometheus Operator.

Deployment Steps

1. Setup Environment

Clone the repository and navigate to the prom folder:

git clone https://github.com/JeffersonLab/jiriaf-test-platform.git
cd jiriaf-test-platform/main/prom

2. Install Prometheus Operator

Instead of using Helm, we'll use the community-maintained manifests from the kube-prometheus project:

a. Clone the kube-prometheus repository:

git clone --depth 1 https://github.com/prometheus-operator/kube-prometheus.git /tmp/kube-prometheus

b. Copy the manifests to your current directory:

cp -R /tmp/kube-prometheus/manifests .

c. Create the Custom Resource Definitions (CRDs) and Prometheus Operator:

kubectl create -f ./manifests/setup/

d. Apply the remaining manifests:

kubectl create -f ./manifests/

e. Verify the installation:

kubectl -n monitoring get pods

3. Configure Values

Edit values.yaml to set your specific configuration:

Deployment:
  name: <project-id>
  namespace: default

PersistentVolume:
  node: jiriaf2302-control-plane
  path: /var/prom
  size: 5Gi

Prometheus:
  serviceaccount: prometheus-k8s
  namespace: monitoring

Key configurations:

Deployment.name: Used for job naming, persistent volume path, and service monitoring selection
Deployment.namespace: Specifies job namespace and namespace monitoring selection
PersistentVolume.*: Configures storage for Prometheus data
Prometheus.*: Sets Prometheus server details

Note: Only those servicemonitors with the namespace default can be monitored. To monitor additional namespaces, additional configuration is required. Refer to the Prometheus Operator documentation on customizations for details.

4. Install the Custom Prometheus Helm Chart

Run the following command, replacing <project-id> with your identifier:

helm install <project-id>-prom prom/ --set Deployment.name=<project-id>

Example:

ID=jlab-100g-nersc-ornl
helm install $ID-prom prom/ --set Deployment.name=$ID

5. Verify Deployment

Check that all components are running:

kubectl get pods -n monitoring
kubectl get pv

6. Access Grafana Dashboard

a. Find the Grafana service:

kubectl get svc -n monitoring

b. Set up port forwarding (map port 3000 from k8s cluster to host:1800):

kubectl port-forward svc/prometheus-operator-grafana -n monitoring 80:3000

c. Access Grafana at http://localhost:1800 (default credentials: admin/admin or admin/prom-operator)

7. Remove Prometheus Helm Chart (if needed)

Notice this will remove the persistent volume claim, and the data will be lost.

# 1. Remove the persistent volume claim
kubectl delete pvc -n monitoring prometheus-<project-id>-db-prometheus-<project-id>-0
# 2. Remove the Prometheus Helm Chart
helm uninstall <project-id>-prom

8. Create a Data Source on Grafana

Open Grafana Dashboard

Log in to your Grafana instance.

Navigate to Data Sources

In the left-hand menu, go to Settings → Data Sources.
Click Add data source.

Select Prometheus

From the list of available data sources, choose Prometheus.

Configure the Connection

URL: Enter the URL for your Prometheus server:

http://<name-of-svc-of-prometheus>.monitoring.svc:9090

To find the service name, use the following command:

kubectl get svc -n monitoring

Look for the name of the desired Prometheus service in the output.

Namespace and Service Context

monitoring: Represents the Kubernetes namespace where the Prometheus service is deployed.
svc: Denotes that the connection points to a service object within the namespace.

Test and Save

Scroll down and click Save & Test.
Ensure the connection is successful. If not, double-check the URL and namespace details.

Components Deployed

Prometheus Server (prometheus.yaml)
Persistent Volume for data storage (prom-pv.yaml)
Empty directory creation job (prom-create_emptydir.yaml)

Integration with Workflows

This setup is designed to monitor services and jobs created by your workflow system.

Advanced Configuration

For further customization, refer to the Helm chart templates and values.yaml. Ensure your cluster has the necessary permissions and resources for persistent volumes and Prometheus server operation.

Troubleshooting

If you encounter issues:

Check pod status: kubectl get pods -n monitoring
View pod logs: kubectl logs <pod-name> -n monitoring
Ensure persistent volume is correctly bound: kubectl get pv
Verify Prometheus configuration: kubectl get prometheus -n monitoring -o yaml

Deploy Prometheus Monitoring with Prometheus Operator

Contents

Deploy Prometheus Monitoring with Prometheus Operator

Prerequisites

Deployment Flow Chart

Deployment Steps

1. Setup Environment

2. Install Prometheus Operator

3. Configure Values

4. Install the Custom Prometheus Helm Chart

5. Verify Deployment

6. Access Grafana Dashboard

7. Remove Prometheus Helm Chart (if needed)

8. Create a Data Source on Grafana

Open Grafana Dashboard

Navigate to Data Sources

Select Prometheus

Configure the Connection

Namespace and Service Context

Test and Save

Components Deployed

Integration with Workflows

Advanced Configuration

Troubleshooting

Navigation menu

Search