Show HN: Chonkie Cloud – No-nonsense chunking now on the the cloud
(cloud.chonkie.ai)
6 points
by: snyy
5 days ago
☆
(cloud.chonkie.ai)
6 points
by: snyy
5 days ago
☆
Hey HN,
We’re the developers of Chonkie, a powerful and easy-to-use chunking library. Last November, we introduced Chonkie OSS here on HN (https://news.ycombinator.com/item?id=42100819) and got some fantastic feedback. Today, we’re excited to share Chonkie Cloud, a fully hosted service for chunking, along with new tools to make building and debugging RAG systems easier than ever.
---
A quick recap for context:
We launched Chonkie as an open-source project late last year. A few weeks ago, we decided to go full-time on it. Unfortunately, this shift wasn’t as smooth as we had hoped. Due to some legal stuff, we had to rebuild the entire project from scratch in a new repo.
Restarting sucked but it gave us the chance to clean things up and build something faster, cleaner, and better. You can check out the new repo here: https://github.com/chonkie-inc/chonkie
---
What we're launching today:
- Chonkie Cloud – A fully hosted chunking service. Just send in your texts and get chunks back. It supports all our chunkers and comes with a playground to help visualize RAG. Try it out: https://chonkie.cloud (no credit card required)
- Chonkie Recipes – Predefined chunking configs to help you quickly get started. If you're not sure which settings to use, just find the closest match in our recipes. These work with the Recursive and Late chunkers. Available here: https://huggingface.co/datasets/chonkie-ai/recipes
- Chonkie Vizard – A tool to visualize chunking output during development. If you've ever debugged with print statements full of dashes and stars, this is for you.
Installation: pip install "chonkie[viz]"
Example usage (1) : viz.print(chunks) # Print in terminal
Example usage (2): viz.save("out.html", chunks) # Save for later
---
What's next:
- A code chunker
- File “chefs” to help clean and prepare data
- Integrations with vector DBs and data extraction tools
- Support for more languages (typescript is #1 on our list)
---
We’d love feedback on what we’re building. If you have feature requests or ideas, please open an issue on the repo (https://github.com/chonkie-inc/chonkie) or comment here.
Thank you!
4 comments
☆
chandan_hegde
4 days ago
Great work guys! My team at work is experimenting with analyzing huge amounts of subjective data at scale with LLMs and it’s really putting a strain on our budgets. I’ll definitely talk to my boss about importing these tools into our systems, I can definitely see this saving our costs.
☆
skeptrune
5 days ago
prev
next
[ - ]
This is fantastic! I especially apprecciate the vizard tool. Anything that makes it easier to literally "see" what's going as you try to the split the data is an amazing upgrade in my book.
☆
snyy
5 days ago
parent
next
[ - ]
[ x ]
Glad it helps! :)
☆
chiccomagnus
3 days ago
prev
[ - ]
How does credits works?
next
[ - ]