XCrawlCommencez en 30 secondes.Aucune carte de crédit requise. Découvrez tout gratuitement.Commencer l’essai gratuit

Arxiv Paper Scraper API

XCrawl's Arxiv Paper Scraper API revolutionizes paper scraping from arXiv.org, delivering structured JSON data on titles, authors, abstracts, and PDFs. Overcome parsing complexities, rate limits, and IP blocking with our robust, scalable paper scraping solution designed for backend developers.

Démarrer l'essai gratuit
Contacter le service commercial

Que pouvez-vous construire avec le scraper Arxiv Paper Scraper API ?

Build massive research datasets for AI training via efficient paper scraping. Conduct citation analysis and author network mapping using detailed metadata extraction. Automate literature reviews and trend monitoring by scraping Arxiv search results and category feeds for real-time insights.

XCrawl

Structured JSON Output

Receive clean, parseable JSON with paper metadata, authors, abstracts, and links—ideal for Python scripts and database ingestion without manual parsing.

XCrawl

Scalable Bulk Scraping

Process thousands of Arxiv papers asynchronously, supporting high-volume paper scraping for machine learning datasets and academic research pipelines.

XCrawl

Real-time Paper Updates

Capture newly published papers instantly via API endpoints, enabling dynamic paper scraping for trend analysis and alert systems.

XCrawl

PDF Extraction Ready

Direct PDF URLs and optional text extraction in JSON, streamlining paper scraping workflows for full-text analysis and archiving.

Adopté par des équipes data-driven du monde entier

Utilisé par les équipes analytics, recherche, veille & croissance.

XCrawl

Scrapers Arxiv Paper Scraper API disponibles

Accédez aux formats Arxiv Paper Scraper API les plus utilisés — structurés, normalisés, prêts pour la production.

Arxiv Paper Scraper

Extract full metadata for individual or bulk papers by ID or query.

Méthode de scraping :
  • paper_id
  • title
  • authors
  • abstract
  • summary
  • categories
  • pdf_url
  • published_date

Arxiv Search Scraper

Scrape paper results from keyword searches with relevance ranking.

Méthode de scraping :
  • query
  • paper_id
  • title
  • authors
  • score
  • date
  • abstract_snippet

Arxiv Category Scraper

Fetch latest papers from specific Arxiv categories and lists.

Méthode de scraping :
  • category
  • paper_id
  • title
  • authors
  • submission_date
  • pdf_url
  • subjects

Arxiv Author Scraper

Profile authors and their paper histories with bios and metrics.

Méthode de scraping :
  • author_name
  • affiliation
  • paper_count
  • papers
  • h_index
  • citations

Arxiv PDF Scraper

Download PDF links and extract media content from papers.

Méthode de scraping :
  • paper_id
  • pdf_url
  • images
  • figures
  • file_size
  • content_text

Arxiv Citation Scraper

Gather engagement metrics like citations and references.

Méthode de scraping :
  • paper_id
  • citing_papers
  • citation_count
  • references
  • impact_score

Méthodes de crawling Arxiv Paper Scraper API

XCrawl

API Scraping (pour les développeurs)

Seamlessly integrate our REST API for programmatic paper scraping in your backend workflows.

  • XCrawl
    Python SDK
    Async requests with our Python library for efficient, high-throughput paper scraping.
  • XCrawl
    Node.js Compatible
    Lightweight Node.js wrappers for real-time JSON responses and webhooks.
  • XCrawl
    Custom Endpoints
    Tailored API calls for bulk queries and structured dataset exports.
XCrawl

No-code Scraping (pour équipes ops & growth)

Leverage our dashboard for no-code paper scraping without writing a single line.

  • XCrawl
    Visual Selector
    Point-and-click to build queries for Arxiv papers and categories.
  • XCrawl
    Scheduled Runs
    Automate daily scrapes with cron-like scheduling and notifications.
  • XCrawl
    Export Options
    Download results as CSV, Excel, or JSON for instant analysis.

Exemples de code

Récupérez les posts et infos auteur Arxiv Paper Scraper API en quelques secondes par un simple appel API.

Entrée
Shell
curl -X POST https://xcrawl.com -H "Authorization: YOU_TOKEN" -H "Content-Type: application/json" -d "{\"geo\":\"US\",\"context\":{\"keyword_list\":[{\"keyword\":\"Apple\"}],\"start_page\":1,\"pages\":1},\"source\":\"amazon_search\"}"
Sortie
Json
{
"result":[
{
"content":{
"url":"https://www.amazon.com/s?k=Apple&page=1"
"page":1
"query":"Apple"
"results":{
"organic":[
{
"pos":1
"url":"https://www.amazon.com/sspa/click?ie=UTF8&spc=MTo1NTU4MDIyNzE4MTQ0NDk1OjE3NjM0NDg1NjM6c3BfYXRmOjMwMDg0MTIyMDE1MTYwMjo6MDo6&url=%2FApple-11-inch-Intelligence-Display-All-Day%2Fdp%2FB0DZ73HCJZ%2Fref%3Dsr_1_1_sspa%3Fdib%3DeyJ2IjoiMSJ9.34Y5eLJt-Syg--Dpi7ueLQwL3ml5AvPfvC0eh7LK2pKhXumC_HQT9LBvkLBiFSrOLyabiwA1DN0qC4nDUFqkGrn5VUhsdLQFYgZ3L8DIPuzIgdPdKtqxJq8diyjiiuXTCDm8kcQmj2lflrdB1g_13fvuEjweGI5mAVZVfJ83S_reyt11VBul7Fga7znbDIGVuFDGhy2lICifAICisiNT88x1w5OOasbBiPs42bcbX0Y.sYUV92XFy8V256YhUSF1FPnMdd_kkjo8lMeGBX4Y2Rs%26dib_tag%3Dse%26keywords%3DApple%26qid%3D1763448563%26sr%3D8-1-spons%26sp_csd%3Dd2lkZ2V0TmFtZT1zcF9hdGY%26psc%3D1"
"asin":"B0DZ73HCJZ"
"price":499.99
"title":"SponsoredSponsored You’re seeing this ad based on the product’s relevance to your search query.Leave ad feedback AppleiPad Air 11-inch with M3 chip Built for Apple Intelligence, Liquid Retina Display, 128GB, 12MP Front/Back Camera, Wi-Fi 6E, Touch ID, All-Day Battery Life — Purple"
"rating":4.8
"currency":"USD"
"is_prime":false
"url_image":"https://m.media-amazon.com/images/I/71b-vc2xzlL._AC_UY218_.jpg"
"best_seller":false
"price_upper":499.99
"is_sponsored":false
"sales_volume":"1K+ bought in past month"
"pricing_count":1
"reviews_count":null
"is_amazons_choice":false
"price_strikethrough":599
"shipping_information":"FREE delivery Sun, Nov 23Or fastest delivery Tomorrow, Nov 19"
},
{
"pos":2
"url":"https://www.amazon.com/sspa/click?ie=UTF8&spc=MTo1NTU4MDIyNzE4MTQ0NDk1OjE3NjM0NDg1NjM6c3BfYXRmOjMwMDg0MTI5NzA2MjkwMjo6MDo6&url=%2FApple-Bluetooth-Headphones-Personalized-Effortless%2Fdp%2FB0DGHMNQ5Z%2Fref%3Dsr_1_2_sspa%3Fdib%3DeyJ2IjoiMSJ9.34Y5eLJt-Syg--Dpi7ueLQwL3ml5AvPfvC0eh7LK2pKhXumC_HQT9LBvkLBiFSrOLyabiwA1DN0qC4nDUFqkGrn5VUhsdLQFYgZ3L8DIPuzIgdPdKtqxJq8diyjiiuXTCDm8kcQmj2lflrdB1g_13fvuEjweGI5mAVZVfJ83S_reyt11VBul7Fga7znbDIGVuFDGhy2lICifAICisiNT88x1w5OOasbBiPs42bcbX0Y.sYUV92XFy8V256YhUSF1FPnMdd_kkjo8lMeGBX4Y2Rs%26dib_tag%3Dse%26keywords%3DApple%26qid%3D1763448563%26sr%3D8-2-spons%26sp_csd%3Dd2lkZ2V0TmFtZT1zcF9hdGY%26psc%3D1"
"asin":"B0DGHMNQ5Z"
"price":117
"title":"SponsoredSponsored You’re seeing this ad based on the product’s relevance to your search query.Leave ad feedback AppleAirPods 4 Wireless Earbuds, Bluetooth Headphones, Personalized Spatial Audio, Sweat and Water Resistant, USB-C Charging Case, H2 Chip, Up to 30 Hours of Battery Life, Effortless Setup for iPhone"
"rating":4.5
"currency":"USD"
"is_prime":false
"url_image":"https://m.media-amazon.com/images/I/61iBtxCUabL._AC_UY218_.jpg"
"best_seller":false
"price_upper":117
"is_sponsored":false
"sales_volume":"10K+ bought in past month"
"pricing_count":1
"reviews_count":null
"is_amazons_choice":false
"price_strikethrough":129
"shipping_information":"FREE delivery Sun, Nov 23Or fastest delivery Tomorrow, Nov 19"
},
{
"pos":3
"url":"https://www.amazon.com/Apple-MX542LL-A-AirTag-Pack/dp/B0D54JZTHY/ref=sr_1_3?dib=eyJ2IjoiMSJ9.34Y5eLJt-Syg--Dpi7ueLQwL3ml5AvPfvC0eh7LK2pKhXumC_HQT9LBvkLBiFSrOLyabiwA1DN0qC4nDUFqkGrn5VUhsdLQFYgZ3L8DIPuzIgdPdKtqxJq8diyjiiuXTCDm8kcQmj2lflrdB1g_13fvuEjweGI5mAVZVfJ83S_reyt11VBul7Fga7znbDIGVuFDGhy2lICifAICisiNT88x1w5OOasbBiPs42bcbX0Y.sYUV92XFy8V256YhUSF1FPnMdd_kkjo8lMeGBX4Y2Rs&dib_tag=se&keywords=Apple&qid=1763448563&sr=8-3"
"asin":"B0D54JZTHY"
"price":79.98
"title":"AppleAirTag 4 Pack. Keep Track of and find Your Keys, Wallet, Luggage, Backpack, and More. Simple one-tap Set up with iPhone or iPad"
"rating":4.7
"currency":"USD"
"is_prime":false
"url_image":"https://m.media-amazon.com/images/I/61bMNCeAUAL._AC_UY218_.jpg"
"best_seller":false
"price_upper":79.98
"is_sponsored":false
"sales_volume":"10K+ bought in past month"
"pricing_count":1
"reviews_count":null
"is_amazons_choice":false
"price_strikethrough":99
"shipping_information":"FREE delivery Sun, Nov 23Or fastest delivery Tomorrow, Nov 19"
},
{
"pos":4
"url":"https://www.amazon.com/Apple-MX532LL-A-AirTag/dp/B0CWXNS552/ref=sr_1_4?dib=eyJ2IjoiMSJ9.34Y5eLJt-Syg--Dpi7ueLQwL3ml5AvPfvC0eh7LK2pKhXumC_HQT9LBvkLBiFSrOLyabiwA1DN0qC4nDUFqkGrn5VUhsdLQFYgZ3L8DIPuzIgdPdKtqxJq8diyjiiuXTCDm8kcQmj2lflrdB1g_13fvuEjweGI5mAVZVfJ83S_reyt11VBul7Fga7znbDIGVuFDGhy2lICifAICisiNT88x1w5OOasbBiPs42bcbX0Y.sYUV92XFy8V256YhUSF1FPnMdd_kkjo8lMeGBX4Y2Rs&dib_tag=se&keywords=Apple&qid=1763448563&sr=8-4"
"asin":"B0CWXNS552"
"price":17.97
"title":"AppleAirTag. Keep Track of and find Your Keys, Wallet, Luggage, Backpack, and More. Simple one-tap Set up with iPhone or iPad"
"rating":4.7
"currency":"USD"
"is_prime":false
"url_image":"https://m.media-amazon.com/images/I/71rP7f78eFL._AC_UY218_.jpg"
"best_seller":false
"price_upper":17.97
"is_sponsored":false
"sales_volume":"10K+ bought in past month"
"pricing_count":1
"reviews_count":null
"is_amazons_choice":false
"price_strikethrough":29
"shipping_information":"FREE delivery Sun, Nov 23 on $35 of items shipped by AmazonOr fastest delivery Tomorrow, Nov 19"
},
{
"pos":5
"url":"https://www.amazon.com/Apple-iPad-Pro-13-inch-M5/dp/B0FWCXMR3W/ref=sr_1_5?dib=eyJ2IjoiMSJ9.34Y5eLJt-Syg--Dpi7ueLQwL3ml5AvPfvC0eh7LK2pKhXumC_HQT9LBvkLBiFSrOLyabiwA1DN0qC4nDUFqkGrn5VUhsdLQFYgZ3L8DIPuzIgdPdKtqxJq8diyjiiuXTCDm8kcQmj2lflrdB1g_13fvuEjweGI5mAVZVfJ83S_reyt11VBul7Fga7znbDIGVuFDGhy2lICifAICisiNT88x1w5OOasbBiPs42bcbX0Y.sYUV92XFy8V256YhUSF1FPnMdd_kkjo8lMeGBX4Y2Rs&dib_tag=se&keywords=Apple&qid=1763448563&sr=8-5"
"asin":"B0FWCXMR3W"
"price":2499
"title":"AppleiPad Pro 13-inch (M5): Ultra Retina XDR Display, 2TB, 12MP Front/Back Camera, LiDAR Scanner, Wi-Fi 7 with Apple N1 + 5G Cellular with C1X chip, Face ID, All-Day Battery Life — Space Black"
"rating":4.6
"currency":"USD"
"is_prime":false
"url_image":"https://m.media-amazon.com/images/I/715V3wbnD6L._AC_UY218_.jpg"
"best_seller":false
"price_upper":2499
"is_sponsored":false
"sales_volume":null
"pricing_count":1
"reviews_count":16
"is_amazons_choice":false
"price_strikethrough":""
"shipping_information":"FREE delivery Sun, Nov 23Or fastest delivery Thu, Nov 20"
},
{
"pos":6
"url":"https://www.amazon.com/Apple-Cancellation-Translation-Headphones-High-Fidelity/dp/B0FQFB8FMG/ref=sr_1_6?dib=eyJ2IjoiMSJ9.34Y5eLJt-Syg--Dpi7ueLQwL3ml5AvPfvC0eh7LK2pKhXumC_HQT9LBvkLBiFSrOLyabiwA1DN0qC4nDUFqkGrn5VUhsdLQFYgZ3L8DIPuzIgdPdKtqxJq8diyjiiuXTCDm8kcQmj2lflrdB1g_13fvuEjweGI5mAVZVfJ83S_reyt11VBul7Fga7znbDIGVuFDGhy2lICifAICisiNT88x1w5OOasbBiPs42bcbX0Y.sYUV92XFy8V256YhUSF1FPnMdd_kkjo8lMeGBX4Y2Rs&dib_tag=se&keywords=Apple&qid=1763448563&sr=8-6"
"asin":"B0FQFB8FMG"
"price":249
"title":"AppleAirPods Pro 3 Wireless Earbuds, Active Noise Cancellation, Live Translation, Heart Rate Sensing, Hearing Aid Feature, Bluetooth Headphones, Spatial Audio, High-Fidelity Sound, USB-C Charging"
"rating":4.4
"currency":"USD"
"is_prime":false
"url_image":"https://m.media-amazon.com/images/I/61solmQSSlL._AC_UY218_.jpg"
"best_seller":false
"price_upper":249
"is_sponsored":false
"sales_volume":"10K+ bought in past month"
"pricing_count":1
"reviews_count":null
"is_amazons_choice":false
"price_strikethrough":""
"shipping_information":"FREE delivery Sun, Nov 23Or fastest delivery Tomorrow, Nov 19"
},
{
"pos":7
"url":"https://www.amazon.com/Apple-2025-MacBook-13-inch-Laptop/dp/B0DZD9S5GC/ref=sr_1_7?dib=eyJ2IjoiMSJ9.34Y5eLJt-Syg--Dpi7ueLQwL3ml5AvPfvC0eh7LK2pKhXumC_HQT9LBvkLBiFSrOLyabiwA1DN0qC4nDUFqkGrn5VUhsdLQFYgZ3L8DIPuzIgdPdKtqxJq8diyjiiuXTCDm8kcQmj2lflrdB1g_13fvuEjweGI5mAVZVfJ83S_reyt11VBul7Fga7znbDIGVuFDGhy2lICifAICisiNT88x1w5OOasbBiPs42bcbX0Y.sYUV92XFy8V256YhUSF1FPnMdd_kkjo8lMeGBX4Y2Rs&dib_tag=se&keywords=Apple&qid=1763448563&sr=8-7"
"asin":"B0DZD9S5GC"
"price":749.99
"title":"Apple2025 MacBook Air 13-inch Laptop with M4 chip: Built for Apple Intelligence, 13.6-inch Liquid Retina Display, 16GB Unified Memory, 256GB SSD Storage, 12MP Center Stage Camera, Touch ID; Midnight"
"rating":4.8
"currency":"USD"
"is_prime":false
"url_image":"https://m.media-amazon.com/images/I/71cWZUr9SVL._AC_UY218_.jpg"
"best_seller":false
"price_upper":749.99
"is_sponsored":false
"sales_volume":null
"pricing_count":1
"reviews_count":null
"is_amazons_choice":false
"price_strikethrough":999
"shipping_information":"FREE delivery Sun, Nov 23Or fastest delivery Tomorrow, Nov 19"
},
{
"pos":8
"url":"https://www.amazon.com/Apple-Headphones-Cancellation-Transparency-Personalized/dp/B0DGJ7HYG1/ref=sr_1_8?dib=eyJ2IjoiMSJ9.34Y5eLJt-Syg--Dpi7ueLQwL3ml5AvPfvC0eh7LK2pKhXumC_HQT9LBvkLBiFSrOLyabiwA1DN0qC4nDUFqkGrn5VUhsdLQFYgZ3L8DIPuzIgdPdKtqxJq8diyjiiuXTCDm8kcQmj2lflrdB1g_13fvuEjweGI5mAVZVfJ83S_reyt11VBul7Fga7znbDIGVuFDGhy2lICifAICisiNT88x1w5OOasbBiPs42bcbX0Y.sYUV92XFy8V256YhUSF1FPnMdd_kkjo8lMeGBX4Y2Rs&dib_tag=se&keywords=Apple&qid=1763448563&sr=8-8"
"asin":"B0DGJ7HYG1"
"price":148.99
"title":"AppleAirPods 4 Wireless Earbuds, Bluetooth Headphones, with Active Noise Cancellation, Adaptive Audio, Transparency Mode, Personalized Spatial Audio, USB-C Charging Case, Wireless Charging, H2 Chip"
"rating":4.5
"currency":"USD"
"is_prime":false
"url_image":"https://m.media-amazon.com/images/I/61iBtxCUabL._AC_UY218_.jpg"
"best_seller":false
"price_upper":148.99
"is_sponsored":false
"sales_volume":"10K+ bought in past month"
"pricing_count":1
"reviews_count":null
"is_amazons_choice":false
"price_strikethrough":179
"shipping_information":"FREE delivery Sun, Nov 23Or fastest delivery Tomorrow, Nov 19"
},
],
"amazons_choices":[
],
},
},
},
],
},

Comment fonctionne l’API Scraper Arxiv Paper Scraper API ?

  • XCrawlRotation IP intelligente
  • XCrawlReconnaissance CAPTCHA automatique
  • XCrawlEntêtes HTTP
  • XCrawlParsing automatique des pages
  • XCrawlSupport personnalisable

Que peut faire notre API pour vous ?

XCrawl

Gestion des proxies

Sélection et rotation ML des proxies à partir de notre pool premium de 190 pays.

XCrawl

Empreintes pilotées par l’IA

Entêtes HTTP, JS et empreintes navigateur uniques pour résister aux contenus dynamiques.

XCrawl

Bypass CAPTCHA

Relances et contournement CAPTCHA automatiques pour une collecte ininterrompue.

XCrawl

Extraction de données en masse

Extrayez sur plusieurs pages en même temps, jusqu’à 10k URLs par lot.

XCrawl

Options de livraison multiples

Recevez vos données via SFTP, AWSS3 ou récupérez-les via API.

XCrawl

Scraping programmé

Définissez votre fréquence souhaitée d’extraction automatisée, livrée directement sur votre cloud storage.

XCrawl

Infrastructure sans maintenance

Supprimez les soucis de proxies et d’infrastructure. Plus besoin de bâtir des systèmes de crawler.

XCrawl

Très évolutif

Intégration simple et support de personnalisation.

XCrawl

Support 24/7

Profitez d’un support professionnel pour toute question ou problème.

XCrawl Transparent

Tarification flexible

Tarification transparente de web scraping avec des plans d'abonnement API flexibles. Comparez les coûts d'extraction de données, achetez l'accès crawler et commencez gratuitement — puis évoluez à votre rythme.

Mensuel
Annuel Populaire

Formules évolutives

Formules haut volume pour les équipes en quête de puissance et de support dédié.

Profitez de limites de débit plus élevées, plus de navigateurs concurrents et d’un support prioritaire.

Contacter le service commercial
Nous fournissons des solutions sur mesure d’envergure entreprise

Découvrez d’autres solutions

N
NHTSA Vehicle Recalls Intelligence Scraper API

XCrawl's NHTSA Vehicle Recalls Intelligence Scraper API revolutionizes vehicles scraping by providing instant access to comprehensive recall data. Our robust vehicle scraper bypasses rate limits, IP blocks, and parsing complexities, delivering clean JSON for backend developers building safety intelligence tools.

En savoir plus
b
bitcoin-price-predictor Scraper API

XCrawl's bitcoin-price-predictor Scraper API is the premier price scraper API for extracting real-time bitcoin prices, predictions, and historical data. Bypass parsing headaches and IP blocks with our robust price scraping service, delivering clean JSON for seamless integration into your price monitoring or trading apps.

En savoir plus
H
Hellowork Jobs Search Scraper API

XCrawl's Hellowork Jobs Search Scraper API is the ultimate job web scraper for extracting real-time job listings from Hellowork. Bypass complex parsing hurdles in scraping job sites with our robust job scraper API, delivering clean JSON data for job scraping tools, job search API integrations, and seamless data extraction jobs without IP blocks or CAPTCHAs.

En savoir plus
Y
YouTube Subtitles Scraper API

Harness the power of our YouTube Subtitles Scraper API to effortlessly extract captions, transcripts, and metadata from YouTube videos. Bypass rate limits, parsing headaches, and IP blocks with this robust youtube scraper api, delivering clean JSON data for youtube data scraping, search results, and more—ideal for developers building youtube scraping python tools.

En savoir plus
Y
Youtube Email Scraper - Advanced, Fast And Cheapest Scraper API

XCrawl's Youtube Email Scraper API is the advanced, fast, and cheapest youtube scraper API for extracting emails from YouTube channels, video descriptions, comments, and search results. Bypass rate limits and IP blocks with rotating proxies, parse complex data effortlessly, and receive clean JSON outputs via our youtube scraping api for seamless integration in your email scraping workflows.

En savoir plus
G
Google Maps Places Scraper API

XCrawl's Google Maps Places Scraper API empowers developers to scrape google maps data effortlessly, bypassing IP blocks and complex parsing challenges. Extract business listings, reviews, and location details via a reliable google maps scraper API, delivering clean JSON from google places api endpoints without the hassle of proxies or CAPTCHAs.

En savoir plus

Que disent nos clients ?

★★★★★
5.0

The Arxiv paper scraping API transformed our dataset building—fast, accurate, and easy Python integration.

Dr. Elena Vasquez
Dr. Elena Vasquez
AI Research Lead
★★★★★
4.9

Perfect for bulk paper scraping; JSON output is pristine for citation analysis projects.

Prof. Mark Chen
Prof. Mark Chen
Data Scientist
★★★★★
5.0

Saved weeks on literature scraping—reliable PDF links and metadata every time.

Sarah Lin
Sarah Lin
ML Engineer
★★★★★
4.8

Outstanding paper scraping speed and dataset quality for our trend monitoring tool.

Dr. Raj Patel
Dr. Raj Patel
Academic Analyst
★★★★★
5.0

Seamless API integration for Arxiv search scraping—handles scale effortlessly.

Lisa Wong
Lisa Wong
Backend Developer
★★★★★
4.9

No-code dashboard makes paper scraping accessible; exports are spot-on.

Tom Rivera
Tom Rivera
Research Ops
★★★★★
5.0

Accurate author and category data via this paper scraping powerhouse.

Anna Kowalski
Anna Kowalski
PhD Candidate
★★★★★
4.7

Ideal for real-time paper scraping into our training pipelines.

David Kim
David Kim
NLP Specialist
★★★★★
5.0

Robust against blocks; best tool for scalable Arxiv datasets.

Maria Gonzalez
Maria Gonzalez
DevOps Engineer
★★★★★
4.9

Citation scraping is precise—game-changer for impact analysis.

James O'Brien
James O'Brien
Quant Researcher
★★★★★
5.0

The Arxiv paper scraping API transformed our dataset building—fast, accurate, and easy Python integration.

Dr. Elena Vasquez
Dr. Elena Vasquez
AI Research Lead
★★★★★
4.9

Perfect for bulk paper scraping; JSON output is pristine for citation analysis projects.

Prof. Mark Chen
Prof. Mark Chen
Data Scientist
★★★★★
5.0

Saved weeks on literature scraping—reliable PDF links and metadata every time.

Sarah Lin
Sarah Lin
ML Engineer
★★★★★
4.8

Outstanding paper scraping speed and dataset quality for our trend monitoring tool.

Dr. Raj Patel
Dr. Raj Patel
Academic Analyst
★★★★★
5.0

Seamless API integration for Arxiv search scraping—handles scale effortlessly.

Lisa Wong
Lisa Wong
Backend Developer
★★★★★
4.9

No-code dashboard makes paper scraping accessible; exports are spot-on.

Tom Rivera
Tom Rivera
Research Ops
★★★★★
5.0

Accurate author and category data via this paper scraping powerhouse.

Anna Kowalski
Anna Kowalski
PhD Candidate
★★★★★
4.7

Ideal for real-time paper scraping into our training pipelines.

David Kim
David Kim
NLP Specialist
★★★★★
5.0

Robust against blocks; best tool for scalable Arxiv datasets.

Maria Gonzalez
Maria Gonzalez
DevOps Engineer
★★★★★
4.9

Citation scraping is precise—game-changer for impact analysis.

James O'Brien
James O'Brien
Quant Researcher
★★★★★
5.0

The Arxiv paper scraping API transformed our dataset building—fast, accurate, and easy Python integration.

Dr. Elena Vasquez
Dr. Elena Vasquez
AI Research Lead
★★★★★
4.9

Perfect for bulk paper scraping; JSON output is pristine for citation analysis projects.

Prof. Mark Chen
Prof. Mark Chen
Data Scientist
★★★★★
5.0

Saved weeks on literature scraping—reliable PDF links and metadata every time.

Sarah Lin
Sarah Lin
ML Engineer
★★★★★
4.8

Outstanding paper scraping speed and dataset quality for our trend monitoring tool.

Dr. Raj Patel
Dr. Raj Patel
Academic Analyst
★★★★★
5.0

Seamless API integration for Arxiv search scraping—handles scale effortlessly.

Lisa Wong
Lisa Wong
Backend Developer
★★★★★
4.9

No-code dashboard makes paper scraping accessible; exports are spot-on.

Tom Rivera
Tom Rivera
Research Ops
★★★★★
5.0

Accurate author and category data via this paper scraping powerhouse.

Anna Kowalski
Anna Kowalski
PhD Candidate
★★★★★
4.7

Ideal for real-time paper scraping into our training pipelines.

David Kim
David Kim
NLP Specialist
★★★★★
5.0

Robust against blocks; best tool for scalable Arxiv datasets.

Maria Gonzalez
Maria Gonzalez
DevOps Engineer
★★★★★
4.9

Citation scraping is precise—game-changer for impact analysis.

James O'Brien
James O'Brien
Quant Researcher
★★★★★
5.0

The Arxiv paper scraping API transformed our dataset building—fast, accurate, and easy Python integration.

Dr. Elena Vasquez
Dr. Elena Vasquez
AI Research Lead
★★★★★
4.9

Perfect for bulk paper scraping; JSON output is pristine for citation analysis projects.

Prof. Mark Chen
Prof. Mark Chen
Data Scientist
★★★★★
5.0

Saved weeks on literature scraping—reliable PDF links and metadata every time.

Sarah Lin
Sarah Lin
ML Engineer
★★★★★
4.8

Outstanding paper scraping speed and dataset quality for our trend monitoring tool.

Dr. Raj Patel
Dr. Raj Patel
Academic Analyst
★★★★★
5.0

Seamless API integration for Arxiv search scraping—handles scale effortlessly.

Lisa Wong
Lisa Wong
Backend Developer
★★★★★
4.9

No-code dashboard makes paper scraping accessible; exports are spot-on.

Tom Rivera
Tom Rivera
Research Ops
★★★★★
5.0

Accurate author and category data via this paper scraping powerhouse.

Anna Kowalski
Anna Kowalski
PhD Candidate
★★★★★
4.7

Ideal for real-time paper scraping into our training pipelines.

David Kim
David Kim
NLP Specialist
★★★★★
5.0

Robust against blocks; best tool for scalable Arxiv datasets.

Maria Gonzalez
Maria Gonzalez
DevOps Engineer
★★★★★
4.9

Citation scraping is precise—game-changer for impact analysis.

James O'Brien
James O'Brien
Quant Researcher
ISO 27001
XCrawlISO 27001
CDPR
XCrawlCDPR
Mieux noté par les utilisateurs
XCrawlMieux noté par les utilisateurs
Leader
XCrawlLeader
Plus facile à utiliser
XCrawlPlus facile à utiliser
Prix Meilleur Rapport Qualité
XCrawlPrix Meilleur Rapport Qualité

Questions fréquentes

Tout ce que vous devez savoir sur XCrawl.

How does the Arxiv Paper Scraper API architecture work?
Our distributed crawlers fetch public Arxiv pages, parse content with AI-enhanced extractors, and return structured JSON via REST endpoints.
What factors determine the pricing model?
Pricing scales with API calls, data volume scraped, success rate, and optional premium features like PDF text extraction.
What data coverage and limitations apply?
Full coverage of public papers, metadata, and categories; minor delays for embargoed new submissions, no private content.
Is the paper scraping compliant and legal?
We scrape only publicly available data in line with Arxiv's terms of use and robots.txt, ensuring ethical compliance.
What integration support is available?
Comprehensive docs, Python/Node SDKs, code samples, and Slack support for custom paper scraping setups.

Obtenez les données dont vous avez besoin.

Laissez-nous gérer la collecte des données pendant que vous vous concentrez sur votre travail.

Commencer gratuitement