feat(Threads): Refresh Threads access tokens automatically. #629

elisa-a-v · 2024-11-05T22:19:55Z

Resolves #627

Long-lived tokens need to be at least 24 hours old to be refreshed so upon creation of a new Threads channel, a post_save signal enqueues a job to refresh the token in 2 days. After that, we enqueue a new job in as many seconds as the expires_in value from the token refresh response.

elisa-a-v · 2024-11-05T22:21:14Z

bc/channel/tasks.py

+    # Schedule the next token refresh
+    delay_seconds = (
+        expires_in - 86400
+    )  # Subtract one day to avoid expiration before the task runs
+    queue.enqueue_in(
+        timedelta(seconds=delay_seconds if delay_seconds > 0 else expires_in),
+        refresh_threads_access_token,
+        channel_pk=channel.pk,
+        retry=Retry(
+            max=settings.RQ_MAX_NUMBER_OF_RETRIES,
+            interval=settings.RQ_RETRY_INTERVAL,
+        ),
+    )


As discussed with @ERosendo, although enqueueing a job within the execution of another job isn't the most elegant solution, it's good enough for this use case since the first one will most definitely have finished executing way before the second begins.

mlissner · 2024-11-06T00:43:05Z

bc/channel/tasks.py

+        "grant_type": "th_refresh_token",
+        "access_token": channel.access_token,
+    }
+    response = requests.get(refresh_access_token_url, params=params)


Please don't forget a timeout. It'll haunt you later.

bc/channel/tasks.py

mlissner

This looks generally OK, but I think we need something else to trigger expiration because this creates an ongoing chain of refreshes that's bound to break eventually. If just one refresh fails for whatever reason (say threads is down?), we'll never refresh again and we'll be sad.

Could you check the token's expiration every time you do a post and trigger a refresh if you're getting close to the deadline? You could get fancy and put a semaphore in redis to prevent two or more simultaneous refresh tasks, but I'm not sure it matters. Ideally we're sending one or two posts per hour, say, and so you get:

New post
Refresh task queued
Refresh task completed
Next post

You could have two or more posts at the same time, I suppose, which would lead to:

New post
First refresh task queued
Next post
Second refresh task queued
First refresh complete
Second refresh complete

That's fine too, right?

elisa-a-v · 2024-11-06T23:46:12Z

We did consider an approach like the one you're suggesting but the only way to get the tokens' expiration is via token retrieval, so we have no way of getting the expiration from the API when publishing posts.

@ERosendo and I were discussing some alternatives to achieve something similar by storing the expiration date somewhere (so we can check that every time we do a post), either in our database, or on redis, and they both have their pros and cons:

Storing in database:
- Pro: we can tweak the script just a little bit to add the expiration date for the admin to create the channel using that right away.
- Con: we need to either add a column to the Channel model (which would only be used by Threads channels, so not ideal), or maybe create a new model with a FK to the channel (which is an entire model just for this, so I guess also not ideal? not too bad imo but def not very nice)
Storing in redis:
- Pro: no need to change the database schema.
- Cons:
  - More volatile? we'd have to handle persistence.
  - More changes required: we would probably need to run the script as a command within django instead of a standalone python script.

Another alternative entirely could be to keep the chain of refreshes, and add redundancy by using a lazy refresh on post if we get an authentication error from Threads API. Of course this also has its pros and cons:

Pro: minimal changes? no need to store anything new.
Cons:
- more API calls could potentially be an issue, although the limit seems to be >48.000 API calls in a 24h period so I think we're fine.
- Better error handling would be needed.
- UX could be affected if posts take too long to be posted on Threads, but I'm not sure it would be too bad? Probably less than a few minutes long difference, which sounds reasonable to me for the purposes of a social media bot, but you tell me.
- The race condition seems even worse with this one? Not entirely sure why

I personally like the redis option because it seems more elegant and it's something I'd like to learn about, but the database option with a new model seems like the safest bet? It's probably best for persistence and reliability. The redundancy alternative is not my favorite but it seems simpler to implement at first glance.

What do you think?

Co-authored-by: Mike Lissner <[email protected]>

mlissner · 2024-11-07T00:24:52Z

Redis is plenty persistent for something like this, and you could have a rule that says, "If the key is missing, just refresh the key." If you do that, I think you'd be good to go without having to fiddle with the models.

We do have the redis CLI installed and the redis Python module is available, so you wouldn't need to convert it to a Django command to just make/check a key.

… logic - Eliminated chained enqueued tasks used for token refreshing in Threads integration. - Moved token validation and refresh logic into the Channel model's validate_access_token method. - Adjusted ThreadsAPI methods to handle token expiration and refreshing internally. - Implemented Redis-based locking mechanism to prevent concurrent token refreshes.

feat(Threads): Refresh Threads access tokens automatically.

a7cb896

elisa-a-v requested review from mlissner and ERosendo November 5, 2024 22:19

elisa-a-v commented Nov 5, 2024

View reviewed changes

Merge branch 'main' into threads-refresh-access-token

68a955e

mlissner reviewed Nov 6, 2024

View reviewed changes

bc/channel/tasks.py Outdated Show resolved Hide resolved

mlissner requested changes Nov 6, 2024

View reviewed changes

elisa-a-v and others added 2 commits November 6, 2024 20:52

style(channel_tasks): Improve readability

69ec643

Co-authored-by: Mike Lissner <[email protected]>

fix(Threads): Add timeout to request to refresh access token

3c74070

elisa-a-v added 4 commits November 8, 2024 03:06

feat(threads_script): Add timeouts to requests

ef3f923

docs(Threads): Add docstrings to methods related to token validation

a16c938

docs(Threads): Add docstrings to RefreshableBaseAPIConnector

fa9e7bf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(Threads): Refresh Threads access tokens automatically. #629

feat(Threads): Refresh Threads access tokens automatically. #629

elisa-a-v commented Nov 5, 2024 •

edited

Loading

elisa-a-v Nov 5, 2024

mlissner Nov 6, 2024

mlissner left a comment

elisa-a-v commented Nov 6, 2024

mlissner commented Nov 7, 2024

feat(Threads): Refresh Threads access tokens automatically. #629

Are you sure you want to change the base?

feat(Threads): Refresh Threads access tokens automatically. #629

Conversation

elisa-a-v commented Nov 5, 2024 • edited Loading

elisa-a-v Nov 5, 2024

Choose a reason for hiding this comment

mlissner Nov 6, 2024

Choose a reason for hiding this comment

mlissner left a comment

Choose a reason for hiding this comment

elisa-a-v commented Nov 6, 2024

mlissner commented Nov 7, 2024

elisa-a-v commented Nov 5, 2024 •

edited

Loading