下拉刷新
Repository Details
Shared bynavbar_avatar
repo_avatar
HelloGitHub Rating
0 ratings
Apple's Open-Source Multimodal Language Large Model
Claim
Collect
Share
8.4k
Stars
No
Chinese
Python
Language
Yes
Active
1
Contributors
7
Issues
Yes
Organization
None
Latest
495
Forks
None
License
More
ml-ferret image
Ferret, the open-source multimodal LLM model from Apple, is capable of analyzing and recognizing information on images and drawing bounding boxes. It can respond to queries by asking questions about the information on images and then analyzing the images to provide an answer. Essentially, by providing an image, you can inquire about the information on it, and it will generate a response after analyzing the image.

Comments

Rating:
No comments yet