Skip to content

Commit

Permalink
init
Browse files Browse the repository at this point in the history
  • Loading branch information
ymy-k committed Jan 31, 2024
1 parent 8ab77fc commit 08b3cb5
Show file tree
Hide file tree
Showing 3 changed files with 3 additions and 5 deletions.
Binary file modified .asset/Hi-SAM.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified .asset/overview.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
8 changes: 3 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,22 +3,20 @@
<p align="center">
<a href="https://arxiv.org"><img src="https://img.shields.io/badge/arXiv-Paper-<color>"></a>
</p>
This is the official repository of the paper: [Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation](https://arxiv.org).

This is the official repository of the paper [Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation](https://arxiv.org).

## :sparkles: Highlight

![overview](.asset/overview.png)

- **Hierarchical Text Segmentation.** Hi-SAM unifies text segmentation across stroke, word, text-line, and paragraph level. Hi-SAM also achieves layout analysis as a by-product.
- **High-Quality Text Stroke Segmentation.** High-quality text stroke segmentation by leveraging 1024×1024 mask feature resolution.
- **Hierarchical Text Segmentation.** Hi-SAM unifies text segmentation across stroke, word, text-line, and paragraph levels. Hi-SAM also achieves layout analysis as a by-product.
- **High-Quality Text Stroke Segmentation.** High-quality text stroke segmentation by introducing mask feature of 1024×1024 resolution with minimal modification in SAM's original mask decoder.
- **Automatic and Interactive.** Hi-SAM supports both automatic mask generation and interactive promptable mode. Given a single-point prompt, Hi-SAM provides word, text-line, and paragraph masks.


## :bulb: Overview of Hi-SAM
![Hi-SAM](.asset/Hi-SAM.png)


## :label: TODO

- [ ] Release demo.
Expand Down

0 comments on commit 08b3cb5

Please sign in to comment.