Skip to content

Commit

Permalink
doc: remove conclusion of multimodal post
Browse files Browse the repository at this point in the history
  • Loading branch information
jxnl committed Oct 23, 2024
1 parent 9d9de7d commit f63e1d0
Showing 1 changed file with 0 additions and 6 deletions.
6 changes: 0 additions & 6 deletions docs/blog/posts/multimodal-gemini.md
Original file line number Diff line number Diff line change
Expand Up @@ -172,12 +172,6 @@ The Gemini model analyzes the video and provides structured recommendations. Her
9. **Kin no Kotte Ushi**: A shop specializing in Hida Wagyu Beef Sushi.
10. **Shirakawa-go**: A World Heritage Site in Gifu Prefecture.

## Conclusion

This example demonstrates the power of combining multimodal AI with structured output parsing. By using Gemini with Instructor, we can extract rich, structured information from video content, opening up new possibilities for travel recommendation systems, content analysis, and more.

The ability to process video inputs and generate structured data outputs can be applied to various domains beyond travel, such as education, entertainment, and market research. As multimodal AI continues to evolve, we can expect even more sophisticated applications that bridge the gap between visual content and structured data.

## Limitations, Challenges, and Future Directions

While the current approach demonstrates the power of multimodal AI for video analysis, there are several limitations and challenges to consider:
Expand Down

0 comments on commit f63e1d0

Please sign in to comment.