 Why are my training results so far from the experimental results (Kitchen Dataset)