GitHub - IDEA-Research/Grounded-Segment-Anything: Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything - discu.eu

Reddit

[R] Going further under Grounded-Segment-Anything: integrating Whisper and ChatGPT https://github.com/IDEA-Research/Grounded-Segment-Anything 8 comments 11/4/2023 machinelearning

[P] Grounded-Segment-Anything: Zero-shot Detection and Segmentation https://github.com/IDEA-Research/Grounded-Segment-Anything 7 comments 7/4/2023 machinelearning

Linking pages

Linked pages

GitHub - microsoft/visual-chatgpt: VisualChatGPT https://github.com/microsoft/visual-chatgpt 229 comments
Segment Anything | Meta AI https://segment-anything.com/ 156 comments
GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision https://github.com/openai/whisper/ 126 comments
LLaVA https://llava-vl.github.io/ 54 comments
GitHub - sail-sg/EditAnything https://github.com/sail-sg/EditAnything 24 comments
Segment Anything | Meta AI https://segment-anything.com/demo 12 comments
GitHub - IDEA-Research/GroundingDINO: The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection" https://github.com/IDEA-Research/GroundingDINO 6 comments
GitHub - autodistill/autodistill: Images to inference with no labeling (use foundation models to train supervised models) https://github.com/autodistill/autodistill 5 comments
GitHub - CompVis/stable-diffusion: A latent text-to-image diffusion model https://github.com/CompVis/stable-diffusion 4 comments
Recognize Anything: A Strong Image Tagging Model https://recognize-anything.github.io/ 4 comments
Start Locally | PyTorch https://pytorch.org/get-started/locally/ 3 comments
[2304.08485] Visual Instruction Tuning https://arxiv.org/abs/2304.08485 1 comment
GitHub - roboflow/notebooks: Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM. https://github.com/roboflow/notebooks 1 comment
GitHub - ycheng517/tabletop-handybot: A low-cost AI powered robotic arm assistant https://github.com/ycheng517/tabletop-handybot 1 comment
GitHub - salesforce/LAVIS: LAVIS - A One-stop Library for Language-Vision Intelligence https://github.com/salesforce/LAVIS 0 comments
[2303.04671] Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models https://arxiv.org/abs/2303.04671 0 comments
GitHub - UX-Decoder/Segment-Everything-Everywhere-All-At-Once https://github.com/UX-Decoder/Segment-Everything-Everywhere-All-At-Once 0 comments
GitHub - facebookresearch/segment-anything: The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model. https://github.com/facebookresearch/segment-anything 0 comments
[2304.06718] Segment Everything Everywhere All at Once https://arxiv.org/abs/2304.06718 0 comments
GitHub - ttengwang/Caption-Anything: Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://github.com/ttengwang/Caption-Anything 0 comments

Would you like to stay up to date with Computer science? Checkout Computer science Weekly.

Related searches:

Search whole site: site:github.com

Search title: GitHub - IDEA-Research/Grounded-Segment-Anything: Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

See how to search.

Submit link to: