AutoSearch Channel Research Final Playbook
Date: 2026-04-19 Coverage: mcpmarket.com research + hands-on testing + TikHub paid API integration Positioning: This is the final master guide. The previously scattered docs (mcp-channel-research.md/mcp-test-plan.md/mcp-test-results.md/mcp-no-cookie-inventory.md/mcp-final-summary.md/tikhub-smoke-test.md) are consolidated here. All future channel decisions should start from this document.
TL;DR
Three-tier conclusion:- Free and usable without cookies: Sogou WeChat, open-websearch (4 engines), Paper Search MCP (21 academic sources), PullPush (Reddit history), direct RSS reads, Jina Reader (some sites), and 13 official APIs.
- 5 of 8 hard targets were unblocked through paid TikHub: Xiaohongshu, Weibo, Zhihu, Twitter/X, and Douyin. The remaining 3 (flaky Weibo upstream, Instagram/LinkedIn parameters not fully tuned) were not fully solved in this round.
- Final architecture: AutoSearch uses BYOK (users provide their own TikHub key) and declares dependencies with
requires: [env:TIKHUB_API_KEY]. The base URL is read from an env var so environments can switch cleanly.
Part 1: No-Cookie Usable Inventory (41)
A. Hands-On PASS in This Round (9)
| Option | Platform/Capability | Auth | Notes |
|---|---|---|---|
| Sogou WeChat SERP | WeChat Official Accounts | 🟢 None | weixin.sogou.com/weixin?type=2&query=<q> |
| open-websearch fetch-web + readability | Any page -> clean markdown | 🟢 None | Cleaned with the Mozilla Readability algorithm |
| open-websearch csdn | CSDN technical blogs | 🟢 None | Native Chinese content |
| open-websearch duckduckgo | DuckDuckGo | 🟢 None | Backup general search |
| open-websearch startpage | Startpage (Google proxy) | 🟢 None | Backup general search |
| Paper Search MCP | arXiv + PubMed + Semantic Scholar + OpenAlex + 17 other sources | 🟢 None | One-command access to 21 academic sources, MIT |
| PullPush | Reddit history from 2005 to present | 🟢 None | Full-text q= search is broken; subreddit/time filtering works |
| Substack RSS | Substack / Medium / independent blogs | 🟢 None | All sites expose /feed |
| Jina Reader | Bilibili / weak anti-scraping sites | 🟢 None | Zhihu, Xiaohongshu, and Weibo are blocked |
B. Official APIs Known to Work (13, Integrated in AutoSearch or One-Line Integration)
| Platform | Access Method | Limit |
|---|---|---|
| HackerNews Algolia | Official API | Unlimited |
| Reddit public JSON | /r/<sub>.json | No login |
| Stack Exchange API | Official | 300/day without key |
| GitHub Public Search | Official API | 10/min without token |
| Dev.to | Public API | No key |
| npm / PyPI registry | Official | No key |
| OpenReview API | Official | No key |
| arXiv API | Official | No key |
| Xueqiu / 36kr / InfoQ Chinese | Their web/API surfaces | No key |
| Xiaoyuzhou | RSS / public pages | No key |
| Bilibili public search API | Official | No login |
| YouTube transcript | yt-dlp + whisper (local) | No key |
| Crunchbase public pages | Public search | Limited |
C. New Categories Found on mcpmarket (AutoSearch Gaps) (11)
| Option | Capability | Priority |
|---|---|---|
| Wikipedia MCP | Full encyclopedia + summaries | 🔴 Highest |
| Wikidata | SPARQL + entities | 🔴 Highest |
| Google Trends Explorer | Trend signals | 🔴 Highest |
| Google News Trends | News + trending terms | 🔴 High |
| USPTO | U.S. patents + trademarks | 🟡 As needed |
| Google Patents | Global patents | 🟡 As needed |
| EPSS | CVE + vulnerability scoring | 🟡 As needed |
| Cybersecurity CVE | NVD database | 🟡 As needed |
| OpenStreetMap | Maps + POI | 🟢 As needed |
| CoinGecko | Cryptocurrency | 🟢 As needed |
| Open Meteo | Global weather | 🟢 As needed |
D. Existing Channel Reinforcements (8)
| Option | Reinforces | Notes |
|---|---|---|
| Package Version | npm/PyPI/Maven/Go/Swift/Docker Hub and 9 registries total | One MCP covers 9 registries |
| GitLab MCP | Complements GitHub | Public repos need no token |
| Newsfeed | search-rss preset categories | Free |
| Trend Radar | HN + GitHub signal aggregation | Free |
| Steam Context | Games category | Steam API key is free |
| Zillow | Real-estate category | Mostly no key |
| AI Job Hunting Agent | Indeed + Remotive jobs | Free |
| Sogou WeChat MCP wrappers (ptbsare / fancyboi999) | Official Accounts | Same underlying Sogou SERP |
Part 2: Hard-Target Breakthrough Matrix
🔴 Integrated (Paid TikHub, 5/8)
| Platform | TikHub endpoint | Measured Data |
|---|---|---|
| Xiaohongshu | GET /api/v1/xiaohongshu/web/search_notes?keyword=<q> | Claude 4.7 found 20 notes, including title/author/likes |
| Zhihu | GET /api/v1/zhihu/web/fetch_article_search_v3?keyword=<q> | LLM agent found 20 high-quality Q&A results |
| Douyin | GET /api/v1/douyin/web/fetch_video_search_result_v2?keyword=<q> | DeepSeek found 12 results, highest with 540K likes |
| Twitter/X | GET /api/v1/twitter/web/fetch_search_timeline?keyword=<q>&search_type=Top | 19 results, full fields (favorites/views/retweets) |
| Bilibili | GET /api/v1/bilibili/web/fetch_general_search?keyword=<q>&order=totalrank&page=1&page_size=10 | 8 videos + play counts |
⚠️ Not Fully Solved in This Round (3/8)
| Platform | Status | Debug Direction |
|---|---|---|
TikHub upstream is flaky: web/fetch_search returns ok=1 but cards is intermittently empty; web_v2/fetch_realtime_search 400; app/fetch_search_all 422 | Ask support in TikHub Discord; try web_v2/fetch_ai_smart_search | |
v2/general_search returns 200 OK but all zeroes | Try v2/search_hashtags or targeted keywords (#hashtag / @username) | |
web/search_jobs returns 400, even with geocode | Check TikHub docs for demo parameters; may require a special geocode format |
🟢 Truly Impossible Without Cookies
Facebook / Instagram public personal pages / Xiaohongshu private-account content / Twitter private accounts: even TikHub requires account support. These are not core to the research scenario, so do not invest for now.Part 3: TikHub Field Notes
Basic Information
- Homepage: https://tikhub.io · API base:
https://api.tikhub.io/api/v1/ - Auth:
Authorization: Bearer $TIKHUB_API_KEY - OpenAPI spec:
GET https://api.tikhub.io/openapi.json(1058 endpoints) - Billing: pay-per-request, average **0.001; measured cost is 3-4x higher)
- Free quota: Daily check-in gives a small amount of free credit
Platform Coverage (Confirmed Working)
16 platforms, 1000+ tools:Key Pitfalls (In the Order Encountered)
- Registration requires email verification. Without verification, every endpoint returns 403 “email not verified”.
- The free tier covers only some endpoints. Xiaohongshu/Weibo/Twitter/Douyin all return 402 (Payment Required) until paid balance is > 0. Zhihu and Bilibili are covered by the free tier.
- Do not use
web_v3for Xiaohongshu./xiaohongshu/web_v3/fetch_search_notesreturns 400. Use the older/xiaohongshu/web/search_notes. - Do not use
app/v3for Douyin./douyin/app/v3/fetch_video_search_resultreturns 400. Use/douyin/web/fetch_video_search_result_v2; data is underdata.business_data[i].data(two nested layers). - Weibo endpoints are unstable across versions.
web_v2/fetch_realtime_searchreturns 400;app/fetch_search_allreturns 422;web/fetch_searchreturns 200 butcardsfluctuates. - 400 = upstream scraping failure, not necessarily bad parameters. TikHub’s 400 body says “please check docs and parameters”, but in practice TikHub itself failed to scrape the target site. This kind of 400 is not billed (Only pay for successful requests).
- Twitter structure is already flattened.
data.timelineis a list, not the original GraphQL-style nestedinstructionsstructure. Iterate it directly. Fields:screen_name / text / favorites / views / retweets / replies / created_at / tweet_id / lang.
🚨 Security Note: The Key Is Echoed in Error Responses
TikHub 400 / 403 / 422 response bodies echo the full request headers, including auth information. Integration must:MCP vs API Choice
TikHub officially provides 4 transports: Stdio / SSE / Streamable HTTP / Curl(API). For AutoSearch: choose direct Curl/API calls, not MCP. Reasons:- AutoSearch is a Python plugin installed with
pip install. Using MCP would require users to additionally install Node.js +mcp-remote+ the TikHub MCP server (3 dependencies) - AutoSearch channels are already “tools”; calling MCP from inside a tool is tool-calling-a-tool, and JSON-RPC over stdio adds a useless serialization layer
- Enabling the full TikHub MCP exposes 1000+ tool schemas and injects roughly 400K tokens into Claude context, which is untenable
- Even selecting only the 6 hard-target platforms still exposes ~470 tools (~190K tokens), which is still not ideal
Part 4: Final AutoSearch Integration Architecture
BYOK (Bring Your Own Key) Is the Only Reasonable Plan
Do not:- ❌ Hardcode a TikHub key into the release build (a pip package exposes plaintext)
- ❌ Share one key across all users (violates ToS and drains the balance)
- ✅ Users register and top up at
tikhub.iothemselves - ✅
export TIKHUB_API_KEY=<your-key>, and AutoSearch reads the env var - ✅ SKILL.md declares
requires: [env:TIKHUB_API_KEY]; without a key, the channel is automatically marked unavailable while other channels continue running
Channel Structure (Validated by Pilot)
[via_tikhub, api_search, api_answer_detail] — prefer TikHub when a key is present; fall back to the original path when no key exists.
Leave Room for a Future Proxy
tikhub_client.py reads the base URL from an env var:
5 Channels Pending Integration (Priority Order)
| Priority | Channel | Endpoint | Status |
|---|---|---|---|
| 1 | zhihu | zhihu/web/fetch_article_search_v3 | ✅ pilot complete (branch feat/tikhub-channels) |
| 2 | xiaohongshu | xiaohongshu/web/search_notes | Pending expansion |
| 3 | twitter/web/fetch_search_timeline | Pending expansion | |
| 4 | douyin | douyin/web/fetch_video_search_result_v2 | Pending expansion |
| 5 | bilibili | bilibili/web/fetch_general_search | Pending expansion |
Appendix
A. Test Command Quick Reference (Copy-Paste Ready)
B. External Links
- mcpmarket.com — research source for this round
- tikhub.io — paid platform API
- github.com/Aas-ee/open-webSearch — 8-engine free SERP
- github.com/openags/paper-search-mcp — 21 academic sources
- github.com/jacklenzotti/pullpush-mcp — Reddit history
- r.jina.ai — free URL -> markdown via Jina Reader
C. Related Docs (Before This Round)
Consolidated into this playbook, so you no longer need to read them day to day:mcp-channel-research.md— initial researchmcp-channel-test-plan.md— test planmcp-test-results.md— first round of hands-on testsmcp-no-cookie-inventory.md— no-cookie inventorymcp-final-summary.md— phase summarytikhub-smoke-test.md— detailed TikHub smoke test
tikhub-smoke-test.md is enough.
D. Cost Ledger (As of 2026-04-19)
- TikHub: $0.053 (19 smoke requests)
- Other: $0 (all free options)
Maintenance rule: When a new channel is integrated or a new hard target is unblocked, update the corresponding table in this playbook to keep it as the single source of truth.
Appendix E. 2026-04-19 Supplemental Scan (New No-Key + No-Cookie Options)
This deeper mcpmarket.com pass found 12 fully no-key and no-cookie categories missed earlier. Ranked by research value:🔴 High Value (Strongly Recommended)
| MCP | Capability | Auth | Tested | Why It Matters |
|---|---|---|---|---|
| FRED · Fredapi · FRED Macro Data | U.S. Federal Reserve economic data, 800K+ time series | 🟢 No key | Not tested | Essential macro signal for AI/startup research |
| World Bank Data | Global economic indicators | 🟢 No key | Not tested | Core need for country-comparison research |
| Public APIs Directory (repo) | Free API directory semantic search | 🟢 No key | ✅ 04-19 | Metatool — future engine for AutoSearch to discover new channels |
| Open Library / Books · OpenLibrary | ISBN lookup, full book catalog | 🟢 No auth | Not tested | Foundation for academic/publishing research |
| Word of the Day | Free Dictionary API with definitions, pronunciation, and examples | 🟢 No key | Not tested | Basic fact-checking utility |
| Dash Docset · Enhanced Dash | Local Dash docset query | 🟢 Local | Not tested | Instant technical-doc lookup |
| Hugging Face MCP | Hub model/dataset search | 🟢 Public search without key | ✅ 04-19 | Foundation for AI research |
| Wikipedia API (official) | Encyclopedia search + summaries | 🟢 No key | ✅ 04-19 | Foundation for fact checking and encyclopedia enrichment |
| Wikidata SPARQL (official) | Structured entities + graph queries | 🟢 No key | ✅ 04-19 | Entity relationship queries |
Google Trends via pytrends | Keyword trend time series | 🟢 No key | ✅ 04-19 | Topic heat and brand-comparison signals |
🟡 Integrate As Needed
| MCP | Capability | Auth | Scenario |
|---|---|---|---|
| Flightradar24 · Flight (ADS-B) | Real-time flight tracking | 🟢 Partly no key | Travel / supply-chain research |
| Etherscan MCP · Dune Analytics | Ethereum on-chain data | 🟢 Etherscan free key / 🟡 Dune paid | Web3 research |
| Ethereum JSON-RPC | Native JSON-RPC chain query | 🟢 Public nodes | Deep DeFi research |
| TMDB · IMDb · OMDB | Movie/TV data | 🟢 TMDB/OMDB free key | Content/entertainment research |
| Last.fm (ScrobblerContext) | Music data | 🟢 Free key | Content analysis |
| OpenFoodFacts | Global food database | 🟢 No key | Consumer goods / health research |
🟢 Basic Utilities (Occasional Use)
| MCP | Capability | Auth |
|---|---|---|
| Whois Lookup · Domain Lookup | WHOIS / RDAP lookup | 🟢 No key |
| DeepL MCP | Translation | 🟡 Free tier requires key |
| Pexels MCP | Free stock images | 🟡 Free key |
mcpmarket Has No Coverage for These Categories
- Toutiao / Baijiahao / NetEase Hao (Chinese news aggregators)
- Reuters / TechCrunch / Bloomberg (professional news, RSS only)
- Coursera / edX / Khan Academy / MOOC
- Chinese government open data (National Bureau of Statistics, customs)
- ESPN / sports data
- Air quality / environmental data / real-time carbon emissions
site:, or by waiting for future mcpmarket additions.
2026-04-19 Hands-On Verification (5 Core MCPs)
Smoke tests were run against the underlying APIs for the 5 most important MCPs for “basic facts / trends / AI Hub / metatools.” All passed:| MCP | Underlying API | Smoke Test | Sample |
|---|---|---|---|
| Wikipedia | en.wikipedia.org/w/api.php?action=query&list=search&srsearch=<q> | Search Claude AI -> 5 results | Claude (language model) · Anthropic · OpenAI Codex · Artificial intelligence · Project Maven |
| Wikidata | query.wikidata.org/sparql (SPARQL) | P31 wd:Q11660 query for AI entities -> 5 results | AlphaFold · Lee Luda · Intelligent Autonomous Systems |
| Google Trends | pytrends Python library (unofficial, simulated requests) | 7 days hourly Claude AI vs ChatGPT -> 169 rows | Latest hour Claude AI=3, ChatGPT=45, making the brand-awareness gap obvious |
| Hugging Face Hub | huggingface.co/api/models?search=<q> | Search llama -> 5 results | meta-llama/Llama-3.1-8B-Instruct, 9.36M downloads · Llama-3.3-70B-Instruct, 490K downloads |
| Public APIs Directory | Local datastore/index.json (1426 full APIs) | Full load + filter auth=No -> 668 no-key APIs | 1426 APIs across dozens of categories including Animals/Crypto/Finance/Transport/Security |
api.publicapis.org at all. Its data is built-in JSON (1426 entries), and the embedding index is built locally (.npz file). So even though the old API domain now fails DNS resolution, the MCP works correctly.
Pitfall warning — Google Trends: pytrends is an unofficial simulated-request library. Google occasionally changes cookie/token mechanics and breaks it. Integration must include retries + error tolerance and must not treat it as a hard dependency. It is much more fragile than the other 4 official APIs.
Integration recommendation (lightest path): The first 4 can all be called directly through Python httpx against HTTP APIs; there is no need to start an MCP process. Public APIs Directory can have its index.json copied into AutoSearch and parsed locally. A 1426-row JSON file is small, and embeddings can be computed locally. This keeps AutoSearch free of new external process dependencies.
Updated AutoSearch Priority After the Scan
On top of the 6 “strongly recommended” basics in Part 1 section C, add 3 highest-priority items:- FRED + World Bank (macroeconomic signals, no key and no cookie in one step)
- Public APIs Directory (metatool; lets AutoSearch automatically discover new channels)
- Open Library (foundation for academic/publishing research)