Observability #40

nikita-volkov · 2024-02-14T21:17:00Z

Adds a stream of events describing the significant state changes in the pool for monitoring purposes.

nikita-volkov · 2024-02-14T21:21:21Z

@robx Hey Rob! Can you please give a feedback on this?

steve-chavez · 2024-02-15T05:27:43Z

Many thanks for this @nikita-volkov! Just started experimenting on PostgREST/postgrest#3229

Do you think hasql-pool could also expose metrics like "total waiters"? At one point (when hasql-pool used resource-pool) we had that one and other metrics, see PostgREST/postgrest#2129.

steve-chavez · 2024-02-15T05:30:38Z

library/Hasql/Pool.hs

+  -- If the action is not lightweight, it's recommended to use intermediate bufferring via channels like TBQueue.
+  -- E.g., if the action is @'atomically' . 'writeTBQueue' yourQueue@, then reading from it and processing can be done on a separate thread.


When could this happen? When the pool size is big and there are lots of connections?

I'm talking about the user supplied action here. It should be quick since it will block the pool processes.

steve-chavez · 2024-02-15T05:33:54Z

library/Hasql/Pool/Observation.hs

+  = ConnectionEstablishedObservation UUID
+  | AttemptingToConnectObservation UUID
+  | FailedToConnectObservation UUID (Maybe ByteString)
+  | ConnectionReleasedObservation UUID ReleaseReason


Q: Why does the UUID need to be generated internally? (just wondering if it can be left out for simplifying the code)

It's for identifying the connection. Lets the user isolate events on a particular connection

nikita-volkov · 2024-02-15T16:17:07Z

Many thanks for this @nikita-volkov! Just started experimenting on PostgREST/postgrest#3229

Great! Thanks! Looking forward for feedback!

Do you think hasql-pool could also expose metrics like "total waiters"? At one point (when hasql-pool used resource-pool) we had that one and other metrics, see PostgREST/postgrest#2129.

Could you clarify on what you imply by "total waiters"? Unfortunately I couldn't derive that from the link.

steve-chavez · 2024-02-15T20:16:53Z

Could you clarify on what you imply by "total waiters"? Unfortunately I couldn't derive that from the link.

So the motivation is to be able to prevent poolAcquisitionTimeout, on the postgREST side this results in an API error (ref).

For this, I understand we could use the following metrics:

pending_requests (same idea as total waiters) https://opentelemetry.io/docs/specs/semconv/database/database-metrics/#metric-dbclientconnectionspending_requests
- If this keeps increasing it would be an indicative that a pool acquisition timeout will happen. So an admin can adjust the pool size or tune the queries to prevent it.
- I've also seen this named as "concurrentWaiters: Number of waiting threads" (ref)
pool wait time https://opentelemetry.io/docs/specs/semconv/database/database-metrics/#metric-dbclientconnectionswait_time
- Same idea as above, if it keeps increasing then a pool timeout will happen.

nikita-volkov · 2024-02-15T20:47:25Z

Thanks for the references! I'll see what can be done.

nikita-volkov · 2024-02-15T21:21:43Z

I've taken a look. Seems like both of those metrics are already computable without the features involved in this PR.

executeObservedSession :: Pool -> Monitor -> Session -> IO ()
executeObservedSession pool monitor session = do
  Monitor.incGauge monitor "db.client.connections.pending_requests"
  startTime <- getCurrentTime
  Pool.use pool do
    liftIO $ do
      endTime <- getCurrentTime
      Monitor.observeHistogram monitor "db.client.connections.wait_time" (diffUTCTime endTime startTime)
      Monitor.decGauge monitor "db.client.connections.pending_requests"
    session

@steve-chavez Is that correct?

steve-chavez · 2024-02-15T21:55:41Z

@nikita-volkov You're right! Thanks for the snippet!

steve-chavez · 2024-02-15T21:57:55Z

Monitor.observeHistogram monitor "db.client.connections.wait_time" (diffUTCTime endTime startTime)

Curious, where does observeHistogram come from? I found decGauge on hoogle but not observeHistogram.

nikita-volkov · 2024-02-17T09:34:47Z

Ah. It was just pseudocode. I was implying something like this.

nikita-volkov · 2024-02-26T19:21:00Z

Thanks for cooperation guys! It's released.

nikita-volkov added 2 commits February 14, 2024 23:36

State the goals

4f18b6c

Implement observability

edc33d2

steve-chavez mentioned this pull request Feb 15, 2024

feat: log connection pool events on log-level=info PostgREST/postgrest#3229

Merged

steve-chavez reviewed Feb 15, 2024

View reviewed changes

steve-chavez mentioned this pull request Feb 17, 2024

refactor: add observation module PostgREST/postgrest#3232

Merged

nikita-volkov added 3 commits February 22, 2024 21:02

Move to a status model

fc48a10

Fix export

4a29071

Correct refs

7a0e620

nikita-volkov merged commit 92a6ed9 into master Feb 25, 2024
3 checks passed

steve-chavez mentioned this pull request Apr 26, 2024

Add pool checkout to Server-Timing PostgREST/postgrest#3442

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Observability #40

Observability #40

nikita-volkov commented Feb 14, 2024

nikita-volkov commented Feb 14, 2024

steve-chavez commented Feb 15, 2024

steve-chavez Feb 15, 2024

nikita-volkov Feb 15, 2024

steve-chavez Feb 15, 2024

nikita-volkov Feb 15, 2024

nikita-volkov commented Feb 15, 2024

steve-chavez commented Feb 15, 2024

nikita-volkov commented Feb 15, 2024

nikita-volkov commented Feb 15, 2024

steve-chavez commented Feb 15, 2024

steve-chavez commented Feb 15, 2024

nikita-volkov commented Feb 17, 2024

nikita-volkov commented Feb 26, 2024

		-- If the action is not lightweight, it's recommended to use intermediate bufferring via channels like TBQueue.
		-- E.g., if the action is @'atomically' . 'writeTBQueue' yourQueue@, then reading from it and processing can be done on a separate thread.

Observability #40

Observability #40

Conversation

nikita-volkov commented Feb 14, 2024

nikita-volkov commented Feb 14, 2024

steve-chavez commented Feb 15, 2024

steve-chavez Feb 15, 2024

Choose a reason for hiding this comment

nikita-volkov Feb 15, 2024

Choose a reason for hiding this comment

steve-chavez Feb 15, 2024

Choose a reason for hiding this comment

nikita-volkov Feb 15, 2024

Choose a reason for hiding this comment

nikita-volkov commented Feb 15, 2024

steve-chavez commented Feb 15, 2024

nikita-volkov commented Feb 15, 2024

nikita-volkov commented Feb 15, 2024

steve-chavez commented Feb 15, 2024

steve-chavez commented Feb 15, 2024

nikita-volkov commented Feb 17, 2024

nikita-volkov commented Feb 26, 2024