Updated caching strategy #576

AaDalal · 2024-02-04T22:00:13Z

No description provided.

…d_caching_strategy

el-agua

lgtm

More comments would be helpful? might be a more efficient way that doesn't require looping through all topics but looks fine for now.

el-agua · 2024-04-24T18:42:33Z

backend/tests/review/test_precompute_reviews.py

+
+ def test_one_course(self):
+ self.set_runtime_option()
+ self.precompute_reviews()


Doesn't matter for functionality, but you can also assert error/success responses with the self.out defined

backend/review/models.py

backend/review/management/commands/precompute_pcr_views.py

AaDalal

Mostly nits / namign things, only 1-2 substantive comments.

Feel free to ignore the nits if you don't have time to fix -- they aren't meant to slow you down!

AaDalal · 2024-02-04T22:00:46Z

backend/tests/review/test_api.py

+ "course-reviews",
+ "CIS-1200",
+ {"hello": "world"}, # No cache miss
+ query_params={"semester": OLD_SEMESTER},


@shiva-menta we should make sure this is intended behavior

AaDalal · 2024-04-28T17:25:06Z

backend/review/management/commands/iscimport.py

@@ -277,6 +278,7 @@ def handle(self, *args, **kwargs):

 print("Recomputing Section.has_reviews...")
 recompute_has_reviews()
+ precompute_pcr_views(True, True)


nit: maybe make these keyword args

AaDalal · 2024-04-28T17:26:32Z

backend/review/management/commands/precompute_pcr_views.py

+ if verbose:
+ print("Now precomputing PCR reviews.")
+
+ CachedReviewResponse.objects.all().update(expired=True)


This is not in an atomic block, so what's the potential time in which the cached responses are down totally. Is that acceptable?

NVM i see what you're doing

expired CachedReviewResponses can still be returned from the view

Yeah, but I think there's no reason to not have this in an atomic block either – will change.

AaDalal · 2024-04-28T17:27:51Z

backend/review/management/commands/precompute_pcr_views.py

+ objs_to_update.append(response_obj)
+ except Http404:
+ logging.info(
+ f"Topic returned 404 (topic_id {topic_id}, "


nit: is a 404 an expected behavior? if so, logging.info makes sense; if not logging.error might be better so it looks correct on our sentry

Iirc there are some topics that have a course with no reviews and that leads to a 404 on our current implementation (can't remember if this is the exact error, but it's something similar). So yeah would say it can be expected behavior on certain courses.

AaDalal · 2024-04-28T17:28:49Z

backend/review/management/commands/precompute_pcr_views.py

+
+ if topic_id in topic_set:
+ try:
+ has_count += 1


nit: usually try to minimize the stuff inside the try block to what is absolutely going to throw an error (for debug/code readability reasons)

AaDalal · 2024-04-28T17:32:38Z

backend/review/management/commands/precompute_pcr_views.py

+ print("Now precomputing PCR reviews.")
+
+ CachedReviewResponse.objects.all().update(expired=True)
+ responses = CachedReviewResponse.objects.all()


nit: I got a little confused bc this is called responses. maybe cached_responses is a better name?

AaDalal · 2024-04-28T17:35:37Z

backend/review/management/commands/precompute_pcr_views.py

+
+ CachedReviewResponse.objects.all().update(expired=True)
+ responses = CachedReviewResponse.objects.all()
+ topic_set = {response.topic_id: response for response in responses}


nit: since this is a dict, and also is a mapping only for cached responses (not for all topics) it might be good to rename it

AaDalal · 2024-04-28T17:43:28Z

backend/review/models.py

+ to a JSON object storing summarized course review data (all the data frontend uses to display
+ reviews).
+ """
+


I think topic_id should be unique=True here (theoretically our loading script should enforce this, but I think its good to specify here too?)

AaDalal · 2024-04-28T17:47:07Z

backend/review/views.py

+ topic = recent_course.topic
+ course_id_list = list(topic.courses.values_list("id"))
+ topic_id = ".".join([str(id[0]) for id in sorted(course_id_list)])
+ cache.set(course_code, topic_id, MONTH_IN_SECONDS)


why do we set the cache manually?

instead of using the decorator

also we shpuld make sure this cache is cleared every time we reimport and/or that the TTL is short enough.

Cache is set manually because we want some control over setting the course to topic mappings and the topic to response mappings. Using the decorator, since the routes are based on course, would only allow for course to response mapping, which can be a bit space inefficient.

Yup the relevant cache entries are cleared in the precompute_pcr_reviews script.

shiva-menta and others added 25 commits January 30, 2024 22:10

Add New Cache Strategy

488aa1e

lint

0780c5b

Update clear cache + refactor precompute into separate command

fef73cb

fix circular imports

2ae285e

add cache backends + get it running

302473c

Explicitly only cache most recent topic of full_code

81c3b04

switch to redis cache for B/G caches

28fc5ba

Working with course reviews

21c80e9

Fix superseded logic if there are no superseding courses

7b8117d

Add tests

1c467e9

fix failing test

04906e6

fix failing old pcr test (also fail on master)

f2891b3

add cache clearing

32c727d

handle null returns from superseded aggregation

b09a0b0

Updated DB Strategy

dc3fd4b

Change to DB Cache

2eaa186

Add Transaction and Output Counts

bf05b33

Modify CronTime and Fix Caching

574d8a0

Change Courses cronjob

3616679

Add Initial Tests

2766b66

Modify Command for Course Selection

82463ca

Fix Tests

dd91f16

Remove Old Method Code Leftover

0db42a2

Lint

139bf45

Lint again..

11321d3

shiva-menta requested a review from el-agua April 24, 2024 08:11

shiva-menta added 3 commits April 24, 2024 12:31

Fix Failing Tests with Topic fix

78964ea

Lint error

1c2f442

Merge branch 'master' of github.com:pennlabs/penn-courses into update…

54d9995

…d_caching_strategy

el-agua approved these changes Apr 24, 2024

View reviewed changes

AaDalal commented Apr 28, 2024

View reviewed changes

shiva-menta and others added 5 commits May 17, 2024 18:01

Resolving PR Nits

c36b84f

Resolve Migration Issue

2b7e704

Linting Migration Files

dad1130

fix(type)

5159a38

fix(lint)

7b0c121

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated caching strategy #576

Updated caching strategy #576

AaDalal commented Feb 4, 2024

el-agua left a comment

el-agua Apr 24, 2024

AaDalal left a comment

AaDalal Feb 4, 2024

AaDalal Apr 28, 2024

AaDalal Apr 28, 2024

AaDalal Apr 28, 2024

AaDalal Apr 28, 2024

shiva-menta May 17, 2024

AaDalal Apr 28, 2024

shiva-menta May 17, 2024

AaDalal Apr 28, 2024

AaDalal Apr 28, 2024

AaDalal Apr 28, 2024

AaDalal Apr 28, 2024

AaDalal Apr 28, 2024

AaDalal Apr 28, 2024

AaDalal Apr 28, 2024

shiva-menta May 17, 2024

shiva-menta May 17, 2024

Updated caching strategy #576

Are you sure you want to change the base?

Updated caching strategy #576

Conversation

AaDalal commented Feb 4, 2024

el-agua left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AaDalal left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment