-
Notifications
You must be signed in to change notification settings - Fork 5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pose model output 39x5 #4752
Comments
cc// @DrBubbles42 |
We have not yet officially introduced support for Flutter, and it remains an item on our roadmap. You can monitor the ongoing development progress in our tracking system, where it is categorised as "Work In Progress." here. Regrettably, we are currently unable to specify a timeframe for when this support will become available. It is important to note that the repository you are following is maintained by a member of the community. To address any issues you are encountering, it would be advisable to raise the matter in the pertinent repository associated with the issue you are facing. Thank you |
@kuaashish Thank you for the details. I have been following the development of the Dart buildout, and look forward to its release. However, as I am using the TFLITE embedding, I figured it was platform agnostic and the 39x5 output was some additional data points that were a part of the model when it was "in its prime" 2 years ago. I am hearing this is not the case, and that the Pose model output was always a 33x5 array. Thank you. |
@yichunk Do you know the answer to this? |
Hello, do you know the answer to this question? |
The Pose World Landmarks can be found within the output at index 4. To access the tensors_to_pose_landmarks_and_segmentation.pbtxt file within the Pose Solution's pbtxt files, please follow the initial graph files. Within this file, examine the These tensors provide the following information:
Should you incorporate the pprint.pprint(output) function directly within your code, you will observe the subsequent output:
This observation signifies the presence of a tensor located at index 4 within the output variable, possessing dimensions of [1, 117]. This tensor, denoted as the "world_landmarks" tensor, comprises 33 keypoints, with an additional 6 for debugging purposes. Each keypoint is represented by coordinates in the (x, y, z) order, accounting for the overall size of [1, 117]. |
@kuaashish How to convert the world coordinates to the pixel coordinates of the picture? (1, 117) |
Could you kindly review the aforementioned comment and provide us with an update on whether the issue has been resolved from your end? May we proceed to mark this matter as resolved and close it? Thank you |
Thank you very much for the response. I will leave it to @DrBubbles42 to confirm whether this resolves his concerns. I was asking this on his part. |
@kuaashish Thank you for your response. This answers my question and we can close the issue. |
Thank you for your confirmation. We are now closing this issue as resolved. |
Have I written custom code (as opposed to using a stock example script provided in MediaPipe)
Yes
OS Platform and Distribution
iOS
MediaPipe version
No response
Bazel version
No response
Solution
Pose
Programming Language and version
Dart Flutter
Describe the actual behavior
Inspired by this repo, and the Mediapipe TFLITE pose solution, we are set up with an output of 39x5 instead of the expected 33x5 indicated in this document. The aforementioned repo only displays the first 33 points on the UI, but I am curious to learn about what the last 6 values are at the end of each tensor. I could not find this information anywhere in the docs.
Describe the expected behaviour
No response
Standalone code/steps you may have used to try to get what you need
No response
Other info / Complete Logs
Example Output
The text was updated successfully, but these errors were encountered: