Repository Details
Shared by
HelloGitHub Rating
0 ratings
Claim
Discuss
Collect
Share
8.5k
Stars
No
Chinese
Python
Language
Yes
Active
1
Contributors
7
Issues
Yes
Organization
None
Latest
503
Forks
None
License
More
Ferret, the open-source multimodal LLM model from Apple, is capable of analyzing and recognizing information on images and drawing bounding boxes. It can respond to queries by asking questions about the information on images and then analyzing the images to provide an answer. Essentially, by providing an image, you can inquire about the information on it, and it will generate a response after analyzing the image.
Comments
Rating:
No comments yet