Skip to content

Commit 7ff1cc7

Browse files
committed
docs: update images
1 parent 04d60af commit 7ff1cc7

File tree

1 file changed

+12
-12
lines changed

1 file changed

+12
-12
lines changed

README.md

Lines changed: 12 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -14,10 +14,10 @@ Based on [ggml](https://github.com/ggml-org/ggml) similar to the [llama.cpp](htt
1414

1515
| Model | Task | Backends |
1616
| :------------------------------------------- | :--------------- | :---------- |
17-
| [**MobileSAM**](#segment-anything-model-sam) | Segmentation | CPU, Vulkan |
17+
| [**MobileSAM**](#mobilesam) | Segmentation | CPU, Vulkan |
1818
| [**BiRefNet**](#birefnet) | Segmentation | CPU, Vulkan |
1919
| [**MI-GAN**](#mi-gan) | Inpainting | CPU, Vulkan |
20-
| [**ESRGAN**](#esrgan) | Super-resolution | CPU, Vulkan |
20+
| [**ESRGAN**](#real-esrgan) | Super-resolution | CPU, Vulkan |
2121
| [_Implement a model [**Guide**]_]() | | |
2222

2323
## Get Started
@@ -30,7 +30,7 @@ Get the library and executables:
3030

3131
Let's use MobileSAM to generate a segmentation mask of the plushy on the right by passing in a box describing its approximate location.
3232

33-
<img alt="Example image showing box prompt at pixel location (420, 120) -> (650, 430), and the output mask" src="docs/media/example-sam-coords.jpg" width="400">
33+
<img width="400" height="256" alt="Example image showing box prompt at pixel location (420, 120) - (650, 430), and the output mask" src="https://github.com/user-attachments/assets/0b90ad96-c7d2-4c4c-b028-699433cef704" />
3434

3535
You can download the model and input image here: [MobileSAM-F16.gguf](https://huggingface.co/Acly/MobileSAM-GGUF/resolve/main/MobileSAM-F16.gguf) | [input.jpg](docs/media/input.jpg)
3636

@@ -71,39 +71,39 @@ with your existing data sources and infrastructure.
7171

7272
## Models
7373

74-
### MobileSAM
74+
#### MobileSAM
7575

76-
<img src="docs/media/example-sam.jpg" width="400">
76+
<img width="400" height="256" alt="example-sam" src="https://github.com/user-attachments/assets/9c0fe151-9990-4bb1-b954-7caff560b110" />
7777

7878
[Model download](https://huggingface.co/Acly/MobileSAM-GGUF/tree/main) | [Paper (arXiv)](https://arxiv.org/pdf/2306.14289.pdf) | [Repository (GitHub)](https://github.com/ChaoningZhang/MobileSAM) | [Segment-Anything-Model](https://segment-anything.com/) | License: Apache-2
7979

8080
```sh
8181
vision-cli sam -m MobileSAM-F16.gguf -i input.png -p 300 200 -o mask.png --composite comp.png
8282
```
8383

84-
### BiRefNet
84+
#### BiRefNet
8585

86-
<img src="docs/media/example-birefnet.png" width="400">
86+
<img width="400" height="256" alt="example-birefnet" src="https://github.com/user-attachments/assets/6fce086d-cb89-4717-92a6-9f4a20532b3c" />
8787

8888
[Model download](https://huggingface.co/Acly/BiRefNet-GGUF/tree/main) | [Paper (arXiv)](https://arxiv.org/pdf/2401.03407) | [Repository (GitHub)](https://github.com/ZhengPeng7/BiRefNet) | License: MIT
8989

9090
```sh
9191
vision-cli birefnet -m BiRefNet-lite-F16.gguf -i input.png -o mask.png --composite comp.png
9292
```
9393

94-
### MI-GAN
94+
#### MI-GAN
9595

96-
<img src="docs/media/example-migan.jpg" width="400">
96+
<img width="400" height="256" alt="example-migan" src="https://github.com/user-attachments/assets/cadf1994-7677-4822-94e5-a2ee6c07621f" />
9797

9898
[Model download](https://huggingface.co/Acly/MIGAN-GGUF/tree/main) | [Paper (thecvf.com)](https://openaccess.thecvf.com/content/ICCV2023/papers/Sargsyan_MI-GAN_A_Simple_Baseline_for_Image_Inpainting_on_Mobile_Devices_ICCV_2023_paper.pdf) | [Repository (GitHub)](https://github.com/Picsart-AI-Research/MI-GAN) | License: MIT
9999

100100
```sh
101101
vision-cli migan -m MIGAN-512-places2-F16.gguf -i image.png mask.png -o output.png
102102
```
103103

104-
### Real-ESRGAN
104+
#### Real-ESRGAN
105105

106-
<img src="docs/media/example-esrgan.jpg" width="400">
106+
<img width="400" height="256" alt="example-esrgan" src="https://github.com/user-attachments/assets/a41312d6-836c-4b11-ab5d-2e299ffee10c" />
107107

108108
[Model download](https://huggingface.co/Acly/Real-ESRGAN-GGUF) | [Paper (arXiv)](https://arxiv.org/abs/2107.10833) | [Repository (GitHub)](https://github.com/xinntao/Real-ESRGAN) | License: BSD-3-Clause
109109

@@ -170,4 +170,4 @@ uv run pytest
170170

171171
* [ggml](https://github.com/ggml-org/ggml) - ML inference library | MIT
172172
* [stb-image](https://github.com/nothings/stb) - Image load/save/resize | Public Domain
173-
* [fmt](https://github.com/fmtlib/fmt) - String formatting _(only if compiler doesn't support &lt;format&gt;)_ | MIT
173+
* [fmt](https://github.com/fmtlib/fmt) - String formatting _(only if compiler doesn't support &lt;format&gt;)_ | MIT

0 commit comments

Comments
 (0)