Apache Mesos

Monitor Apache Mesos Virtualization Resource Allocation Layer

❗️

This source has been deprecated

observIQ is in the process of transitioning a subset of BindPlane's monitoring capabilities to the observIQ OpenTelemetry Collector. As a result, this Source is no longer publicly available in BindPlane. If you need access to this Source, please reach out to our support via chat or via [email protected].

Data Collection Setup

Metrics are collected via the Operator HTTP API on the Mesos Master server.

Network Requirements

Port: 5050 (TCP) to the Mesos Master Server

Least Privileged User

Mesos can be configured with or without authentication.

For more information on setting up authentication which will be used for the HTTP API see: http://mesos.apache.org/documentation/latest/authentication/

Supported Versions

Mesos: 1.1+

Connection Parameters

NameRequired?Description
HostRequiredThe Master Node of the Apache Mesos Cluster to connect to.
PortThe port for communication to Apache Mesos REST API.
UsernameRequired
PasswordRequired
SSL ConfigurationThe SSL mode to use when connecting to the target. Can be configured to not use SSL (No SSL), use SSL but do not verify the target's certificate (No Verify), and use SSL and verify the target's certificate (Verify).
Connection Timeout (seconds)The number of seconds to allow for connecting to the target.

Metrics

Agent

NameDescription
ActiveActive
Allocated CPU CountAllocated CPU Count
Allocated CPU Ratio (%)Allocated CPU Ratio
Allocated Disk Space (Megabytes)Allocated Disk Space
Allocated Disk Space Ratio (%)Allocated Disk Space Ratio
Allocated GPU countAllocated GPU count
Allocated GPUs (%)Allocated GPUs
Allocated Memory (Megabytes)Allocated Memory
Allocated Memory Ratio (%)Allocated Memory Ratio
Allocated Revocable CPU CountAllocated Revocable CPU Count
Allocated Revocable CPUs (%)Allocated Revocable CPUs
Allocated Revocable Disk Space (Megabytes)Allocated Revocable Disk Space
Allocated Revocable Disk Space Ratio (%)Allocated Revocable Disk Space Ratio
Allocated Revocable GPU CountAllocated Revocable GPU Count
Allocated Revocable GPUs (%)Allocated Revocable GPUs
Allocated Revocable Memory (Megabytes)Allocated Revocable Memory
Allocated Revocable Memory Ratio (%)Allocated Revocable Memory Ratio
Available Disk Space (Megabytes)Available Disk Space
Available Memory (Megabytes)Available Memory
Container Destroy ErrorsContainer Destroy Errors
CPU CountCPU Count
CPU Load 15 MinutesCPU Load 15 Minutes
CPU Load 1 MinuteCPU Load 1 Minute
CPU Load 5 MinutesCPU Load 5 Minutes
Failed Task FetchesFailed Task Fetches
Failed Tasks CountFailed Tasks Count
Fetcher Total Cache Size (Bytes)Fetcher Total Cache Size
Fetcher Used Cache Size (Bytes)Fetcher Used Cache Size
Finished Tasks CountFinished Tasks Count
Free Memory (Bytes)Free Memory
Garbage Collection Path Removal FailuresGarbage Collection Path Removal Failures
Garbage Collection Path Removal SuccessesGarbage Collection Path Removal Successes
Garbage Collection Pending Path RemovalsGarbage Collection Pending Path Removals
GPU CountGPU Count
HostnameHostname
IDID
Killed Tasks CountKilled Tasks Count
Lost Tasks CountLost Tasks Count
PIDPID
PortPort
RegisteredRegistered
Registered Time (Nanoseconds)Registered Time
Revocable CPU countRevocable CPU count
Revocable Disk Space (Megabytes)Revocable Disk Space
Revocable GPU CountRevocable GPU Count
Revocable Memory (Megabytes)Revocable Memory
Running Tasks CountRunning Tasks Count
Staging Tasks CountStaging Tasks Count
Starting Tasks CountStarting Tasks Count
Successful Task FetchesSuccessful Task Fetches
Total Memory (Bytes)Total Memory
Uptime (Seconds)Uptime
VersionVersion

Cluster

NameDescription
Active Agents CountActive Agents Count
Active Frameworks CountActive Frameworks Count
Agents Canceled UnreachableAgents Canceled Unreachable
Agents Marked UnreachableAgents Marked Unreachable
Agents RejoinedAgents Rejoined
Agents RemovedAgents Removed
Agents ReregisteredAgents Reregistered
Agents Scheduled UnreachableAgents Scheduled Unreachable
Allocated CPU CountAllocated CPU Count
Allocated CPUs (%)Allocated CPUs
Allocated Disk Space (Megabytes)Allocated Disk Space
Allocated Disk Space Ratio (%)Allocated Disk Space Ratio
Allocated GPU CountAllocated GPU Count
Allocated GPUs (%)Allocated GPUs
Allocated Memory (Megabytes)Allocated Memory
Allocated Memory Ratio (%)Allocated Memory Ratio
Allocated Revocable CPUs (%)Allocated Revocable CPUs
Allocated Revocable CPUs CountAllocated Revocable CPUs Count
Allocated Revocable Disk Space (%)Allocated Revocable Disk Space
Allocated Revocable GPU CountAllocated Revocable GPU Count
Allocated Revocable GPUs (%)Allocated Revocable GPUs
Allocated Revocable Memory (Megabytes)Allocated Revocable Memory
Allocated Revocable Memory Ratio (%)Allocated Revocable Memory Ratio
Available Disk Space (Megabytes)Available Disk Space
Available Memory (Megabytes)Available Memory
Connected Agents CountConnected Agents Count
Connected Frameworks CountConnected Frameworks Count
Disconnected Agents CountDisconnected Agents Count
Disconnected Frameworks CountDisconnected Frameworks Count
Draining machinesDraining machines
Failed TasksFailed Tasks
Finished TasksFinished Tasks
Inactive Agents CountInactive Agents Count
Inactive Frameworks CountInactive Frameworks Count
Invalid TasksInvalid Tasks
Killed TasksKilled Tasks
Lost TasksLost Tasks
Maintenance WindowsMaintenance Windows
Outstanding Resource Offers CountOutstanding Resource Offers Count
Revocable CPUs CountRevocable CPUs Count
Revocable Disk Space (Megabytes)Revocable Disk Space
Revocable GPU CountRevocable GPU Count
Revocable Memory (Megabytes)Revocable Memory
Running TasksRunning Tasks
Staging TasksStaging Tasks
Starting TasksStarting Tasks
Tasks Being KilledTasks Being Killed
Total CPU CountTotal CPU Count
Total GPU CountTotal GPU Count
Unreachable TasksUnreachable Tasks
Used Revocable Disk Space (Megabytes)Used Revocable Disk Space

Container

NameDescription
Agent IDAgent ID
CPU LimitCPU Limit
CPU System Time (Seconds)CPU System Time
CPU User Time (Seconds)CPU User Time
Executor IDExecutor ID
Executor NameExecutor Name
Framework IDFramework ID
IDID
Memory Limit (Bytes)Memory Limit
Memory RSS (Bytes)Memory RSS

Executor

NameDescription
Agent IDAgent ID
Command ShellCommand Shell
Command valueCommand value
Framework IDFramework ID
IDID

Framework

NameDescription
ActiveActive
Allocated CPUAllocated CPU
Allocated Disk (Megabytes)Allocated Disk
Allocated Memory (Megabytes)Allocated Memory
ConnectedConnected
Failover Timeout (Milliseconds)Failover Timeout
HostnameHostname
IDID
NameName
RecoveredRecovered
UserThe OS Account that is used by the Agent to run the Executors of this Framework.
Web UI URLWeb UI URL

Master

NameDescription
Build dateBuild date
Build timeBuild time
Build UserBuild User
CPU Load 15 MinutesCPU Load 15 Minutes
CPU Load 1 MinuteCPU Load 1 Minute
CPU Load 5 MinutesCPU Load 5 Minutes
Elected MasterElected Master
Elected Time (Seconds)Elected Time
Free memory (Bytes)Free memory
Git SHAGit SHA
Git TagGit Tag
HealthyHealthy
HostnameHostname
IDID
IP AddressIP Address
Log LevelLog Level
PIDPID
PortPort
Start Time (Seconds)Start Time
Total CPU CountTotal CPU Count
Total memory (Bytes)Total memory
Uptime (Seconds)The current time in seconds this master node has been running. A value consistently below 60 seconds indicates a flapping master node.
VersionVersion

Quota

NameDescription
CPUCPU
Disk (Megabytes)Disk
Memory (Megabytes)Memory
PrincipalPrincipal
RoleRole

Role

NameDescription
CPUCPU
Disk (Megabytes)Disk
Memory (Megabytes)Memory
NameName
Ports EndPorts End
Ports StartPorts Start
WeightWeight

Task

NameDescription
Agent IDAgent ID
CPUThe amount of CPU allocated to this task. The units are seconds of CPU Time per second of wall clock time.
Disk (Megabytes)Disk
Framework IDFramework ID
IDID
Memory (Megabytes)Memory
NameName
StateState
Status Update StateStatus Update State
Status Update UUIDStatus Update UUID

Weight

NameDescription
AmountAmount
RoleRole