Web dataContent

GET /v1/content

Extract clean HTML, Markdown, and structured metadata from any public URL. Ideal for AI pipelines, content monitoring, and indexing.

curl "https://api.chuger.com/v1/content?url=https://example.com/blog/article" \
  -H "Authorization: Bearer YOUR_API_TOKEN"
{
  "url": "https://example.com/blog/article",
  "success": true,
  "statusCode": 200,
  "errorMessage": null,
  "html": "<article>...</article>",
  "markdown": "# Article title

First paragraph...",
  "metadata": {
    "title": "Article title",
    "description": "Short summary of the article",
    "keywords": "web, scraping, example",
    "author": "Jane Doe",
    "language": "en",
    "favicon": "https://example.com/favicon.ico",
    "ogImage": "https://example.com/cover.png",
    "ogImageWidth": "1200",
    "ogImageHeight": "630",
    "ogTitle": "Article title",
    "ogType": "article"
  }
}

Turn a web page into clean HTML, Markdown, and structured metadata.

GET https://api.chuger.com/v1/content

If you just need the raw HTML, use /v1/scrape. To process many URLs at once, use /v1/content/bulk.

Authentication

Bearer token in the Authorization header. See Authentication.

Cost

PlanCredits per request
Basic2
Pro2
Business2

Credits are only deducted on success.

Query parameters

query
urlstring
Required

The URL to extract content from. Must be HTTP or HTTPS, max 180 characters. Raw IP hosts and non-default ports are rejected.

Example

Response fields

urlstring
Required

The URL that was extracted.

successboolean
Required

true when content was successfully extracted.

statusCodeinteger
Required

The HTTP status code returned by the target site.

errorMessagestring

Populated when success is false.

htmlstring

Cleaned, readable HTML of the main content. Boilerplate (nav, footer, ads) is removed.

markdownstring

The same content rendered as Markdown.

metadataobject
Required

Page-level metadata.

Errors

StatusWhen
401Missing / invalid token
402No plan, or insufficient credits
422url missing, malformed, too long, raw IP, or non-default port
429Rate limit or monthly quota exceeded
503Content could not be extracted

See Errors for the full reference.

Tips

markdown is convenient for feeding into LLMs and search indexes. Use the metadata block to populate previews, cards, and search results in your UI without an extra parse step.

For batch jobs, /v1/content/bulk is much more efficient than firing many parallel requests here.