hc-tec/my-collection-skills
用 CookieCloud + API/Playwright 拉取 B站/知乎/小红书收藏(文件夹/收藏夹/专辑),含音频下载+Whisper 转写。
This tool helps social media users collect and organize content they've saved across Bilibili, Zhihu, and Xiaohongshu. It takes your account's saved videos, articles, and notes as input and outputs structured lists, full text content, or video transcripts (including audio-to-text conversion for videos without subtitles). It's designed for anyone who wants to easily review, summarize, or analyze their personal collection of online content.
Use this if you frequently save content on Bilibili, Zhihu, or Xiaohongshu and need a way to extract, organize, and review that content, especially if you want text transcripts or full article text.
Not ideal if you're looking for a public content scraping tool or if you're uncomfortable with managing your own login cookies.
Stars
28
Forks
5
Language
Python
License
MIT
Category
Last pushed
Feb 09, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/hc-tec/my-collection-skills"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
seleniumbase/SeleniumBase
APIs for browser automation, testing, and bypassing bot-detection.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....
intoli/user-agents
A JavaScript library for generating random user agents with data that's updated daily.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...
Kaliiiiiiiiii-Vinyzu/patchright
Undetected version of the Playwright testing and automation library.