Jeremiah M PRO

doublelotus

AI & ML interests

None yet

Recent Activity

published a Space about 1 month ago

doublelotus/o-test

updated a Space about 1 month ago

doublelotus/colorsteven

published a Space about 1 month ago

doublelotus/colorsteven

View all activity

Organizations

None yet

doublelotus's activity

published a Space about 1 month ago

O Test

🐨

ols test

updated a Space about 1 month ago

Colorsteven

😻

test color

published a Space about 1 month ago

Colorsteven

😻

test color

updated 2 Spaces 5 months ago

Dismantle

📚

Dismantle

📚

updated a Space 6 months ago

Realify

😻

updated a Space about 1 year ago

Cleave Fastapi

🐢

replied to Xenova's post about 1 year ago

Hey! really appreciate the reply, I've just raised this github issue like you said with more info - https://github.com/xenova/transformers.js/issues/585

reacted to Xenova's post with ❤️ about 1 year ago

Post

Last week, we released 🤗 Transformers.js v2.14, which added support for SAM (Segment Anything Model).

This means you can now generate high-quality segmentation masks for objects in a scene, directly in your browser! 🤯

Demo (+ source code): Xenova/segment-anything-web
Model: Xenova/slimsam-77-uniform

But how does this differ from Meta's original demo? 🤔 Didn't that also run in-browser?

Well, in their demo, the image embeddings are computed server-side, then sent to the client for decoding. Trying to do this all client-side would be completely impractical: taking minutes per image! 😵‍💫

That's where SlimSAM comes to the rescue! SlimSAM is a novel SAM compression method, able to shrink the model over 100x (637M → 5.5M params), while still achieving remarkable results!

The best part? You can get started in a few lines of JavaScript code, thanks to Transformers.js! 🔥

// npm i @xenova/transformers
import { SamModel, AutoProcessor, RawImage } from '@xenova/transformers';

// Load model and processor
const model = await SamModel.from_pretrained('Xenova/slimsam-77-uniform');
const processor = await AutoProcessor.from_pretrained('Xenova/slimsam-77-uniform');

// Prepare image and input points
const img_url = 'https://huggingface.co/datasets/Xenova/transformers.js-docs/resolve/main/corgi.jpg';
const raw_image = await RawImage.read(img_url);
const input_points = [[[340, 250]]];

// Process inputs and perform mask generation
const inputs = await processor(raw_image, input_points);
const outputs = await model(inputs);

// Post-process masks
const masks = await processor.post_process_masks(outputs.pred_masks, inputs.original_sizes, inputs.reshaped_input_sizes);
console.log(masks);

// Visualize the mask
const image = RawImage.fromTensor(masks[0][0].mul(255));
image.save('mask.png');

I can't wait to see what you build with it! 🤗

12 replies

replied to Xenova's post about 1 year ago

Hey, quick question on this. I've been playing around with it and loving it. I wanted to know that if I wanted to take Metas approach and compute the image embeddings server side would I be able to use the normal sam-vit-base on the server alongside xenova\sam-vit-base on the frontend for decoding?

updated 3 Spaces about 1 year ago

Jeremiah M PRO

AI & ML interests

Recent Activity

Organizations

doublelotus's activity

O Test

Colorsteven

Colorsteven

Dismantle

Dismantle

Realify

Cleave Fastapi

Text Generation Pipeline

Deep 2

Hotdognothotdog