huggingface · elvinagam · Mar 6, 2025
diff --git a/chapters/en/chapter12/3.mdx b/chapters/en/chapter12/3.mdx
@@ -11,9 +11,10 @@ In the next chapter, we will build on this knowledge and implement GRPO in pract
 The initial goal of the paper was to explore whether pure reinforcement learning could develop reasoning capabilities without supervised fine-tuning. 
 
 <Tip>
-Up until that point, all the popular LLMs required some supervised fine-tuning, which we explored in [chapter 11](/chapters/en/chapter11/1).
+Up until that point, all the popular LLMs required some supervised fine-tuning, which we explored in <a href="../chapter11/1.mdx">chapter 11</a>.
 </Tip>
 
+
 ## The Breakthrough 'Aha' Moment
 
 ![The 'Aha Moment'](https://huggingface.co/reasoning-course/images/resolve/main/grpo/9.png)