XCrawlGet started in 30 seconds.No credit card required. Explore everything for freeStart Free Trial

Arxiv Paper Scraper API

XCrawl's Arxiv Paper Scraper API revolutionizes paper scraping from arXiv.org, delivering structured JSON data on titles, authors, abstracts, and PDFs. Overcome parsing complexities, rate limits, and IP blocking with our robust, scalable paper scraping solution designed for backend developers.

Start free trial
Contact Sales

What Can You Build With Arxiv Paper Scraper API Scraper?

Build massive research datasets for AI training via efficient paper scraping. Conduct citation analysis and author network mapping using detailed metadata extraction. Automate literature reviews and trend monitoring by scraping Arxiv search results and category feeds for real-time insights.

XCrawl

Structured JSON Output

Receive clean, parseable JSON with paper metadata, authors, abstracts, and links—ideal for Python scripts and database ingestion without manual parsing.

XCrawl

Scalable Bulk Scraping

Process thousands of Arxiv papers asynchronously, supporting high-volume paper scraping for machine learning datasets and academic research pipelines.

XCrawl

Real-time Paper Updates

Capture newly published papers instantly via API endpoints, enabling dynamic paper scraping for trend analysis and alert systems.

XCrawl

PDF Extraction Ready

Direct PDF URLs and optional text extraction in JSON, streamlining paper scraping workflows for full-text analysis and archiving.

Trusted by Data-Driven Teams Worldwide

Used by teams across analytics, research, monitoring, and growth workflows.

XCrawl

Available Arxiv Paper Scraper API Scrapers

Access the most commonly used Arxiv Paper Scraper API data types — fully structured, consistently formatted, and production-ready.

Arxiv Paper Scraper

Extract full metadata for individual or bulk papers by ID or query.

Scraping method:
  • paper_id
  • title
  • authors
  • abstract
  • summary
  • categories
  • pdf_url
  • published_date

Arxiv Search Scraper

Scrape paper results from keyword searches with relevance ranking.

Scraping method:
  • query
  • paper_id
  • title
  • authors
  • score
  • date
  • abstract_snippet

Arxiv Category Scraper

Fetch latest papers from specific Arxiv categories and lists.

Scraping method:
  • category
  • paper_id
  • title
  • authors
  • submission_date
  • pdf_url
  • subjects

Arxiv Author Scraper

Profile authors and their paper histories with bios and metrics.

Scraping method:
  • author_name
  • affiliation
  • paper_count
  • papers
  • h_index
  • citations

Arxiv PDF Scraper

Download PDF links and extract media content from papers.

Scraping method:
  • paper_id
  • pdf_url
  • images
  • figures
  • file_size
  • content_text

Arxiv Citation Scraper

Gather engagement metrics like citations and references.

Scraping method:
  • paper_id
  • citing_papers
  • citation_count
  • references
  • impact_score

Arxiv Paper Scraper API crawling methods

XCrawl

API Scraping (For Developers)

Seamlessly integrate our REST API for programmatic paper scraping in your backend workflows.

  • XCrawl
    Python SDK
    Async requests with our Python library for efficient, high-throughput paper scraping.
  • XCrawl
    Node.js Compatible
    Lightweight Node.js wrappers for real-time JSON responses and webhooks.
  • XCrawl
    Custom Endpoints
    Tailored API calls for bulk queries and structured dataset exports.
XCrawl

No-Code Scraping (For Ops & Growth Teams)

Leverage our dashboard for no-code paper scraping without writing a single line.

  • XCrawl
    Visual Selector
    Point-and-click to build queries for Arxiv papers and categories.
  • XCrawl
    Scheduled Runs
    Automate daily scrapes with cron-like scheduling and notifications.
  • XCrawl
    Export Options
    Download results as CSV, Excel, or JSON for instant analysis.

Code examples

Retrieve Arxiv Paper Scraper API posts and author information in seconds with a simple API call.

Input
Shell
curl -X POST https://xcrawl.com -H "Authorization: YOU_TOKEN" -H "Content-Type: application/json" -d "{\"geo\":\"US\",\"context\":{\"keyword_list\":[{\"keyword\":\"Apple\"}],\"start_page\":1,\"pages\":1},\"source\":\"amazon_search\"}"
Output
Json
{
"result":[
{
"content":{
"url":"https://www.amazon.com/s?k=Apple&page=1"
"page":1
"query":"Apple"
"results":{
"organic":[
{
"pos":1
"url":"https://www.amazon.com/sspa/click?ie=UTF8&spc=MTo1NTU4MDIyNzE4MTQ0NDk1OjE3NjM0NDg1NjM6c3BfYXRmOjMwMDg0MTIyMDE1MTYwMjo6MDo6&url=%2FApple-11-inch-Intelligence-Display-All-Day%2Fdp%2FB0DZ73HCJZ%2Fref%3Dsr_1_1_sspa%3Fdib%3DeyJ2IjoiMSJ9.34Y5eLJt-Syg--Dpi7ueLQwL3ml5AvPfvC0eh7LK2pKhXumC_HQT9LBvkLBiFSrOLyabiwA1DN0qC4nDUFqkGrn5VUhsdLQFYgZ3L8DIPuzIgdPdKtqxJq8diyjiiuXTCDm8kcQmj2lflrdB1g_13fvuEjweGI5mAVZVfJ83S_reyt11VBul7Fga7znbDIGVuFDGhy2lICifAICisiNT88x1w5OOasbBiPs42bcbX0Y.sYUV92XFy8V256YhUSF1FPnMdd_kkjo8lMeGBX4Y2Rs%26dib_tag%3Dse%26keywords%3DApple%26qid%3D1763448563%26sr%3D8-1-spons%26sp_csd%3Dd2lkZ2V0TmFtZT1zcF9hdGY%26psc%3D1"
"asin":"B0DZ73HCJZ"
"price":499.99
"title":"SponsoredSponsored You’re seeing this ad based on the product’s relevance to your search query.Leave ad feedback AppleiPad Air 11-inch with M3 chip Built for Apple Intelligence, Liquid Retina Display, 128GB, 12MP Front/Back Camera, Wi-Fi 6E, Touch ID, All-Day Battery Life — Purple"
"rating":4.8
"currency":"USD"
"is_prime":false
"url_image":"https://m.media-amazon.com/images/I/71b-vc2xzlL._AC_UY218_.jpg"
"best_seller":false
"price_upper":499.99
"is_sponsored":false
"sales_volume":"1K+ bought in past month"
"pricing_count":1
"reviews_count":null
"is_amazons_choice":false
"price_strikethrough":599
"shipping_information":"FREE delivery Sun, Nov 23Or fastest delivery Tomorrow, Nov 19"
},
{
"pos":2
"url":"https://www.amazon.com/sspa/click?ie=UTF8&spc=MTo1NTU4MDIyNzE4MTQ0NDk1OjE3NjM0NDg1NjM6c3BfYXRmOjMwMDg0MTI5NzA2MjkwMjo6MDo6&url=%2FApple-Bluetooth-Headphones-Personalized-Effortless%2Fdp%2FB0DGHMNQ5Z%2Fref%3Dsr_1_2_sspa%3Fdib%3DeyJ2IjoiMSJ9.34Y5eLJt-Syg--Dpi7ueLQwL3ml5AvPfvC0eh7LK2pKhXumC_HQT9LBvkLBiFSrOLyabiwA1DN0qC4nDUFqkGrn5VUhsdLQFYgZ3L8DIPuzIgdPdKtqxJq8diyjiiuXTCDm8kcQmj2lflrdB1g_13fvuEjweGI5mAVZVfJ83S_reyt11VBul7Fga7znbDIGVuFDGhy2lICifAICisiNT88x1w5OOasbBiPs42bcbX0Y.sYUV92XFy8V256YhUSF1FPnMdd_kkjo8lMeGBX4Y2Rs%26dib_tag%3Dse%26keywords%3DApple%26qid%3D1763448563%26sr%3D8-2-spons%26sp_csd%3Dd2lkZ2V0TmFtZT1zcF9hdGY%26psc%3D1"
"asin":"B0DGHMNQ5Z"
"price":117
"title":"SponsoredSponsored You’re seeing this ad based on the product’s relevance to your search query.Leave ad feedback AppleAirPods 4 Wireless Earbuds, Bluetooth Headphones, Personalized Spatial Audio, Sweat and Water Resistant, USB-C Charging Case, H2 Chip, Up to 30 Hours of Battery Life, Effortless Setup for iPhone"
"rating":4.5
"currency":"USD"
"is_prime":false
"url_image":"https://m.media-amazon.com/images/I/61iBtxCUabL._AC_UY218_.jpg"
"best_seller":false
"price_upper":117
"is_sponsored":false
"sales_volume":"10K+ bought in past month"
"pricing_count":1
"reviews_count":null
"is_amazons_choice":false
"price_strikethrough":129
"shipping_information":"FREE delivery Sun, Nov 23Or fastest delivery Tomorrow, Nov 19"
},
{
"pos":3
"url":"https://www.amazon.com/Apple-MX542LL-A-AirTag-Pack/dp/B0D54JZTHY/ref=sr_1_3?dib=eyJ2IjoiMSJ9.34Y5eLJt-Syg--Dpi7ueLQwL3ml5AvPfvC0eh7LK2pKhXumC_HQT9LBvkLBiFSrOLyabiwA1DN0qC4nDUFqkGrn5VUhsdLQFYgZ3L8DIPuzIgdPdKtqxJq8diyjiiuXTCDm8kcQmj2lflrdB1g_13fvuEjweGI5mAVZVfJ83S_reyt11VBul7Fga7znbDIGVuFDGhy2lICifAICisiNT88x1w5OOasbBiPs42bcbX0Y.sYUV92XFy8V256YhUSF1FPnMdd_kkjo8lMeGBX4Y2Rs&dib_tag=se&keywords=Apple&qid=1763448563&sr=8-3"
"asin":"B0D54JZTHY"
"price":79.98
"title":"AppleAirTag 4 Pack. Keep Track of and find Your Keys, Wallet, Luggage, Backpack, and More. Simple one-tap Set up with iPhone or iPad"
"rating":4.7
"currency":"USD"
"is_prime":false
"url_image":"https://m.media-amazon.com/images/I/61bMNCeAUAL._AC_UY218_.jpg"
"best_seller":false
"price_upper":79.98
"is_sponsored":false
"sales_volume":"10K+ bought in past month"
"pricing_count":1
"reviews_count":null
"is_amazons_choice":false
"price_strikethrough":99
"shipping_information":"FREE delivery Sun, Nov 23Or fastest delivery Tomorrow, Nov 19"
},
{
"pos":4
"url":"https://www.amazon.com/Apple-MX532LL-A-AirTag/dp/B0CWXNS552/ref=sr_1_4?dib=eyJ2IjoiMSJ9.34Y5eLJt-Syg--Dpi7ueLQwL3ml5AvPfvC0eh7LK2pKhXumC_HQT9LBvkLBiFSrOLyabiwA1DN0qC4nDUFqkGrn5VUhsdLQFYgZ3L8DIPuzIgdPdKtqxJq8diyjiiuXTCDm8kcQmj2lflrdB1g_13fvuEjweGI5mAVZVfJ83S_reyt11VBul7Fga7znbDIGVuFDGhy2lICifAICisiNT88x1w5OOasbBiPs42bcbX0Y.sYUV92XFy8V256YhUSF1FPnMdd_kkjo8lMeGBX4Y2Rs&dib_tag=se&keywords=Apple&qid=1763448563&sr=8-4"
"asin":"B0CWXNS552"
"price":17.97
"title":"AppleAirTag. Keep Track of and find Your Keys, Wallet, Luggage, Backpack, and More. Simple one-tap Set up with iPhone or iPad"
"rating":4.7
"currency":"USD"
"is_prime":false
"url_image":"https://m.media-amazon.com/images/I/71rP7f78eFL._AC_UY218_.jpg"
"best_seller":false
"price_upper":17.97
"is_sponsored":false
"sales_volume":"10K+ bought in past month"
"pricing_count":1
"reviews_count":null
"is_amazons_choice":false
"price_strikethrough":29
"shipping_information":"FREE delivery Sun, Nov 23 on $35 of items shipped by AmazonOr fastest delivery Tomorrow, Nov 19"
},
{
"pos":5
"url":"https://www.amazon.com/Apple-iPad-Pro-13-inch-M5/dp/B0FWCXMR3W/ref=sr_1_5?dib=eyJ2IjoiMSJ9.34Y5eLJt-Syg--Dpi7ueLQwL3ml5AvPfvC0eh7LK2pKhXumC_HQT9LBvkLBiFSrOLyabiwA1DN0qC4nDUFqkGrn5VUhsdLQFYgZ3L8DIPuzIgdPdKtqxJq8diyjiiuXTCDm8kcQmj2lflrdB1g_13fvuEjweGI5mAVZVfJ83S_reyt11VBul7Fga7znbDIGVuFDGhy2lICifAICisiNT88x1w5OOasbBiPs42bcbX0Y.sYUV92XFy8V256YhUSF1FPnMdd_kkjo8lMeGBX4Y2Rs&dib_tag=se&keywords=Apple&qid=1763448563&sr=8-5"
"asin":"B0FWCXMR3W"
"price":2499
"title":"AppleiPad Pro 13-inch (M5): Ultra Retina XDR Display, 2TB, 12MP Front/Back Camera, LiDAR Scanner, Wi-Fi 7 with Apple N1 + 5G Cellular with C1X chip, Face ID, All-Day Battery Life — Space Black"
"rating":4.6
"currency":"USD"
"is_prime":false
"url_image":"https://m.media-amazon.com/images/I/715V3wbnD6L._AC_UY218_.jpg"
"best_seller":false
"price_upper":2499
"is_sponsored":false
"sales_volume":null
"pricing_count":1
"reviews_count":16
"is_amazons_choice":false
"price_strikethrough":""
"shipping_information":"FREE delivery Sun, Nov 23Or fastest delivery Thu, Nov 20"
},
{
"pos":6
"url":"https://www.amazon.com/Apple-Cancellation-Translation-Headphones-High-Fidelity/dp/B0FQFB8FMG/ref=sr_1_6?dib=eyJ2IjoiMSJ9.34Y5eLJt-Syg--Dpi7ueLQwL3ml5AvPfvC0eh7LK2pKhXumC_HQT9LBvkLBiFSrOLyabiwA1DN0qC4nDUFqkGrn5VUhsdLQFYgZ3L8DIPuzIgdPdKtqxJq8diyjiiuXTCDm8kcQmj2lflrdB1g_13fvuEjweGI5mAVZVfJ83S_reyt11VBul7Fga7znbDIGVuFDGhy2lICifAICisiNT88x1w5OOasbBiPs42bcbX0Y.sYUV92XFy8V256YhUSF1FPnMdd_kkjo8lMeGBX4Y2Rs&dib_tag=se&keywords=Apple&qid=1763448563&sr=8-6"
"asin":"B0FQFB8FMG"
"price":249
"title":"AppleAirPods Pro 3 Wireless Earbuds, Active Noise Cancellation, Live Translation, Heart Rate Sensing, Hearing Aid Feature, Bluetooth Headphones, Spatial Audio, High-Fidelity Sound, USB-C Charging"
"rating":4.4
"currency":"USD"
"is_prime":false
"url_image":"https://m.media-amazon.com/images/I/61solmQSSlL._AC_UY218_.jpg"
"best_seller":false
"price_upper":249
"is_sponsored":false
"sales_volume":"10K+ bought in past month"
"pricing_count":1
"reviews_count":null
"is_amazons_choice":false
"price_strikethrough":""
"shipping_information":"FREE delivery Sun, Nov 23Or fastest delivery Tomorrow, Nov 19"
},
{
"pos":7
"url":"https://www.amazon.com/Apple-2025-MacBook-13-inch-Laptop/dp/B0DZD9S5GC/ref=sr_1_7?dib=eyJ2IjoiMSJ9.34Y5eLJt-Syg--Dpi7ueLQwL3ml5AvPfvC0eh7LK2pKhXumC_HQT9LBvkLBiFSrOLyabiwA1DN0qC4nDUFqkGrn5VUhsdLQFYgZ3L8DIPuzIgdPdKtqxJq8diyjiiuXTCDm8kcQmj2lflrdB1g_13fvuEjweGI5mAVZVfJ83S_reyt11VBul7Fga7znbDIGVuFDGhy2lICifAICisiNT88x1w5OOasbBiPs42bcbX0Y.sYUV92XFy8V256YhUSF1FPnMdd_kkjo8lMeGBX4Y2Rs&dib_tag=se&keywords=Apple&qid=1763448563&sr=8-7"
"asin":"B0DZD9S5GC"
"price":749.99
"title":"Apple2025 MacBook Air 13-inch Laptop with M4 chip: Built for Apple Intelligence, 13.6-inch Liquid Retina Display, 16GB Unified Memory, 256GB SSD Storage, 12MP Center Stage Camera, Touch ID; Midnight"
"rating":4.8
"currency":"USD"
"is_prime":false
"url_image":"https://m.media-amazon.com/images/I/71cWZUr9SVL._AC_UY218_.jpg"
"best_seller":false
"price_upper":749.99
"is_sponsored":false
"sales_volume":null
"pricing_count":1
"reviews_count":null
"is_amazons_choice":false
"price_strikethrough":999
"shipping_information":"FREE delivery Sun, Nov 23Or fastest delivery Tomorrow, Nov 19"
},
{
"pos":8
"url":"https://www.amazon.com/Apple-Headphones-Cancellation-Transparency-Personalized/dp/B0DGJ7HYG1/ref=sr_1_8?dib=eyJ2IjoiMSJ9.34Y5eLJt-Syg--Dpi7ueLQwL3ml5AvPfvC0eh7LK2pKhXumC_HQT9LBvkLBiFSrOLyabiwA1DN0qC4nDUFqkGrn5VUhsdLQFYgZ3L8DIPuzIgdPdKtqxJq8diyjiiuXTCDm8kcQmj2lflrdB1g_13fvuEjweGI5mAVZVfJ83S_reyt11VBul7Fga7znbDIGVuFDGhy2lICifAICisiNT88x1w5OOasbBiPs42bcbX0Y.sYUV92XFy8V256YhUSF1FPnMdd_kkjo8lMeGBX4Y2Rs&dib_tag=se&keywords=Apple&qid=1763448563&sr=8-8"
"asin":"B0DGJ7HYG1"
"price":148.99
"title":"AppleAirPods 4 Wireless Earbuds, Bluetooth Headphones, with Active Noise Cancellation, Adaptive Audio, Transparency Mode, Personalized Spatial Audio, USB-C Charging Case, Wireless Charging, H2 Chip"
"rating":4.5
"currency":"USD"
"is_prime":false
"url_image":"https://m.media-amazon.com/images/I/61iBtxCUabL._AC_UY218_.jpg"
"best_seller":false
"price_upper":148.99
"is_sponsored":false
"sales_volume":"10K+ bought in past month"
"pricing_count":1
"reviews_count":null
"is_amazons_choice":false
"price_strikethrough":179
"shipping_information":"FREE delivery Sun, Nov 23Or fastest delivery Tomorrow, Nov 19"
},
],
"amazons_choices":[
],
},
},
},
],
},

How the Arxiv Paper Scraper API Scraper API works?

  • XCrawlIntelligent IP rotation
  • XCrawlAutomatic CAPTCHA recognition
  • XCrawlHTTP headers
  • XCrawlAutomatic webpage parsing
  • XCrawlCustomizable support

What can our API do for you?

XCrawl

Proxy management

ML-driven proxy selection and rotation using our premium proxy pool from 190 countries.

XCrawl

AI-driven fingerprinting

Unique HTTP headers, JavaScript, and browser fingerprints ensure resilience to dynamic content.

XCrawl

CAPTCHA bypass

Automatic retries and CAPTCHA bypassing for uninterrupted data retrieval.

XCrawl

Bulk data extraction

Extract data from several pages at the same time with up to 10K URLs per batch.

XCrawl

Multiple delivery options

Receive data via cloud storage such as SFTP or AWSS3, or retrieve results through APIs.

XCrawl

Scheduled scraping

Set your preferred frequency for automated, custom-timed data collection, with results delivered directly to your cloud storage.

XCrawl

Maintenance-free infrastructure

Eliminate proxy maintenance and infrastructure hassle. No need to build crawler systems.

XCrawl

Highly scalable

Easy to integrate with support for customization.

XCrawl

24/7 support

Receive professional support in case of anyquestions or issues.

XCrawl Transparent

Flexible Pricing

Transparent web scraping pricing with flexible API subscription plans. Compare data extraction costs, purchase crawler access, and start free — then scale as you grow.

Monthly
Yearly Hot

Scale Plans

High-volume plans for teams that need more power and dedicated support.

Enjoy higher rate limits, more concurrent browsers, and priority support.

Contact Sales
We Provide Enterprise-Level Customization

Explore more solutions

N
NHTSA Vehicle Recalls Intelligence Scraper API

XCrawl's NHTSA Vehicle Recalls Intelligence Scraper API revolutionizes vehicles scraping by providing instant access to comprehensive recall data. Our robust vehicle scraper bypasses rate limits, IP blocks, and parsing complexities, delivering clean JSON for backend developers building safety intelligence tools.

Learn More
b
bitcoin-price-predictor Scraper API

XCrawl's bitcoin-price-predictor Scraper API is the premier price scraper API for extracting real-time bitcoin prices, predictions, and historical data. Bypass parsing headaches and IP blocks with our robust price scraping service, delivering clean JSON for seamless integration into your price monitoring or trading apps.

Learn More
H
Hellowork Jobs Search Scraper API

XCrawl's Hellowork Jobs Search Scraper API is the ultimate job web scraper for extracting real-time job listings from Hellowork. Bypass complex parsing hurdles in scraping job sites with our robust job scraper API, delivering clean JSON data for job scraping tools, job search API integrations, and seamless data extraction jobs without IP blocks or CAPTCHAs.

Learn More
Y
YouTube Subtitles Scraper API

Harness the power of our YouTube Subtitles Scraper API to effortlessly extract captions, transcripts, and metadata from YouTube videos. Bypass rate limits, parsing headaches, and IP blocks with this robust youtube scraper api, delivering clean JSON data for youtube data scraping, search results, and more—ideal for developers building youtube scraping python tools.

Learn More
Y
Youtube Email Scraper - Advanced, Fast And Cheapest Scraper API

XCrawl's Youtube Email Scraper API is the advanced, fast, and cheapest youtube scraper API for extracting emails from YouTube channels, video descriptions, comments, and search results. Bypass rate limits and IP blocks with rotating proxies, parse complex data effortlessly, and receive clean JSON outputs via our youtube scraping api for seamless integration in your email scraping workflows.

Learn More
G
Google Maps Places Scraper API

XCrawl's Google Maps Places Scraper API empowers developers to scrape google maps data effortlessly, bypassing IP blocks and complex parsing challenges. Extract business listings, reviews, and location details via a reliable google maps scraper API, delivering clean JSON from google places api endpoints without the hassle of proxies or CAPTCHAs.

Learn More

What do our customers say?

★★★★★
5.0

The Arxiv paper scraping API transformed our dataset building—fast, accurate, and easy Python integration.

Dr. Elena Vasquez
Dr. Elena Vasquez
AI Research Lead
★★★★★
4.9

Perfect for bulk paper scraping; JSON output is pristine for citation analysis projects.

Prof. Mark Chen
Prof. Mark Chen
Data Scientist
★★★★★
5.0

Saved weeks on literature scraping—reliable PDF links and metadata every time.

Sarah Lin
Sarah Lin
ML Engineer
★★★★★
4.8

Outstanding paper scraping speed and dataset quality for our trend monitoring tool.

Dr. Raj Patel
Dr. Raj Patel
Academic Analyst
★★★★★
5.0

Seamless API integration for Arxiv search scraping—handles scale effortlessly.

Lisa Wong
Lisa Wong
Backend Developer
★★★★★
4.9

No-code dashboard makes paper scraping accessible; exports are spot-on.

Tom Rivera
Tom Rivera
Research Ops
★★★★★
5.0

Accurate author and category data via this paper scraping powerhouse.

Anna Kowalski
Anna Kowalski
PhD Candidate
★★★★★
4.7

Ideal for real-time paper scraping into our training pipelines.

David Kim
David Kim
NLP Specialist
★★★★★
5.0

Robust against blocks; best tool for scalable Arxiv datasets.

Maria Gonzalez
Maria Gonzalez
DevOps Engineer
★★★★★
4.9

Citation scraping is precise—game-changer for impact analysis.

James O'Brien
James O'Brien
Quant Researcher
★★★★★
5.0

The Arxiv paper scraping API transformed our dataset building—fast, accurate, and easy Python integration.

Dr. Elena Vasquez
Dr. Elena Vasquez
AI Research Lead
★★★★★
4.9

Perfect for bulk paper scraping; JSON output is pristine for citation analysis projects.

Prof. Mark Chen
Prof. Mark Chen
Data Scientist
★★★★★
5.0

Saved weeks on literature scraping—reliable PDF links and metadata every time.

Sarah Lin
Sarah Lin
ML Engineer
★★★★★
4.8

Outstanding paper scraping speed and dataset quality for our trend monitoring tool.

Dr. Raj Patel
Dr. Raj Patel
Academic Analyst
★★★★★
5.0

Seamless API integration for Arxiv search scraping—handles scale effortlessly.

Lisa Wong
Lisa Wong
Backend Developer
★★★★★
4.9

No-code dashboard makes paper scraping accessible; exports are spot-on.

Tom Rivera
Tom Rivera
Research Ops
★★★★★
5.0

Accurate author and category data via this paper scraping powerhouse.

Anna Kowalski
Anna Kowalski
PhD Candidate
★★★★★
4.7

Ideal for real-time paper scraping into our training pipelines.

David Kim
David Kim
NLP Specialist
★★★★★
5.0

Robust against blocks; best tool for scalable Arxiv datasets.

Maria Gonzalez
Maria Gonzalez
DevOps Engineer
★★★★★
4.9

Citation scraping is precise—game-changer for impact analysis.

James O'Brien
James O'Brien
Quant Researcher
★★★★★
5.0

The Arxiv paper scraping API transformed our dataset building—fast, accurate, and easy Python integration.

Dr. Elena Vasquez
Dr. Elena Vasquez
AI Research Lead
★★★★★
4.9

Perfect for bulk paper scraping; JSON output is pristine for citation analysis projects.

Prof. Mark Chen
Prof. Mark Chen
Data Scientist
★★★★★
5.0

Saved weeks on literature scraping—reliable PDF links and metadata every time.

Sarah Lin
Sarah Lin
ML Engineer
★★★★★
4.8

Outstanding paper scraping speed and dataset quality for our trend monitoring tool.

Dr. Raj Patel
Dr. Raj Patel
Academic Analyst
★★★★★
5.0

Seamless API integration for Arxiv search scraping—handles scale effortlessly.

Lisa Wong
Lisa Wong
Backend Developer
★★★★★
4.9

No-code dashboard makes paper scraping accessible; exports are spot-on.

Tom Rivera
Tom Rivera
Research Ops
★★★★★
5.0

Accurate author and category data via this paper scraping powerhouse.

Anna Kowalski
Anna Kowalski
PhD Candidate
★★★★★
4.7

Ideal for real-time paper scraping into our training pipelines.

David Kim
David Kim
NLP Specialist
★★★★★
5.0

Robust against blocks; best tool for scalable Arxiv datasets.

Maria Gonzalez
Maria Gonzalez
DevOps Engineer
★★★★★
4.9

Citation scraping is precise—game-changer for impact analysis.

James O'Brien
James O'Brien
Quant Researcher
★★★★★
5.0

The Arxiv paper scraping API transformed our dataset building—fast, accurate, and easy Python integration.

Dr. Elena Vasquez
Dr. Elena Vasquez
AI Research Lead
★★★★★
4.9

Perfect for bulk paper scraping; JSON output is pristine for citation analysis projects.

Prof. Mark Chen
Prof. Mark Chen
Data Scientist
★★★★★
5.0

Saved weeks on literature scraping—reliable PDF links and metadata every time.

Sarah Lin
Sarah Lin
ML Engineer
★★★★★
4.8

Outstanding paper scraping speed and dataset quality for our trend monitoring tool.

Dr. Raj Patel
Dr. Raj Patel
Academic Analyst
★★★★★
5.0

Seamless API integration for Arxiv search scraping—handles scale effortlessly.

Lisa Wong
Lisa Wong
Backend Developer
★★★★★
4.9

No-code dashboard makes paper scraping accessible; exports are spot-on.

Tom Rivera
Tom Rivera
Research Ops
★★★★★
5.0

Accurate author and category data via this paper scraping powerhouse.

Anna Kowalski
Anna Kowalski
PhD Candidate
★★★★★
4.7

Ideal for real-time paper scraping into our training pipelines.

David Kim
David Kim
NLP Specialist
★★★★★
5.0

Robust against blocks; best tool for scalable Arxiv datasets.

Maria Gonzalez
Maria Gonzalez
DevOps Engineer
★★★★★
4.9

Citation scraping is precise—game-changer for impact analysis.

James O'Brien
James O'Brien
Quant Researcher
ISO 27001
XCrawlISO 27001
CDPR
XCrawlCDPR
Top-Rated by Users
XCrawlTop-Rated by Users
Leader
XCrawlLeader
Easiest To Use
XCrawlEasiest To Use
Best Value Award
XCrawlBest Value Award

Frequently asked questions

Everything you need to know about XCrawl.

How does the Arxiv Paper Scraper API architecture work?
Our distributed crawlers fetch public Arxiv pages, parse content with AI-enhanced extractors, and return structured JSON via REST endpoints.
What factors determine the pricing model?
Pricing scales with API calls, data volume scraped, success rate, and optional premium features like PDF text extraction.
What data coverage and limitations apply?
Full coverage of public papers, metadata, and categories; minor delays for embargoed new submissions, no private content.
Is the paper scraping compliant and legal?
We scrape only publicly available data in line with Arxiv's terms of use and robots.txt, ensuring ethical compliance.
What integration support is available?
Comprehensive docs, Python/Node SDKs, code samples, and Slack support for custom paper scraping setups.

Get the data you need.

Let us handle the data collection while you focus on your work.

Start for Free