diff --git a/scipaperloader/templates/index.html.jinja b/scipaperloader/templates/index.html.jinja index c098dfd..c9ee6b7 100644 --- a/scipaperloader/templates/index.html.jinja +++ b/scipaperloader/templates/index.html.jinja @@ -7,78 +7,140 @@ {% block content %} - -{% include "partials/flash_messages.html.jinja" %} - -
+

Welcome to SciPaperLoader

-

Your paper scraping tool is ready.

-

A simple tool to scrape papers from Zotero API.

+

Your comprehensive paper management and scraping platform

+

Automate paper collection, manage metadata, and monitor download progress with intelligent + scheduling

-
-
-
+ +
+
+

🚀 Core Features

+
+ +
+
-
📄 CSV Import
+
🎛️ Scraper Control Panel

- Upload a 37-column CSV to import paper metadata. Only relevant fields - (title, DOI, ISSN, etc.) are stored. Errors are reported without - aborting the batch. + Start, pause, and monitor the automated paper scraping process. View real-time statistics, + activity charts, and process individual papers on demand.

- Upload Now + Open Control Panel
-
-
-
-
🧠 Background Scraper
-

- A daemon process runs hourly to fetch papers using Zotero API. - Downloads are randomized to mimic human behavior and avoid detection. -

- View Logs -
-
-
- -
-
+
+
📚 Paper Management

- Monitor paper status (Pending, Done, Failed), download PDFs, and - inspect errors. Files are stored on disk in structured folders per - DOI. + Browse, search, and manage your paper collection. View download status, + inspect metadata, export data, and handle failed downloads.

- Browse Papers + Browse Papers
-
-
+
+
-
🕒 Download Schedule
+
📄 CSV Data Import

- Control how many papers are downloaded per hour. Configure hourly - volume (e.g. 2/hour at daytime, 0 at night) to match your bandwidth or - usage pattern. + Bulk import paper metadata from CSV files. Supports 37-column format with + intelligent duplicate detection and comprehensive error reporting.

- Adjust Schedule + Import Data +
+
+
+
+ + +
+
+

⚙️ Configuration & Monitoring

+
+ +
+
+
+
🕒 Download Scheduling
+

+ Configure hourly download quotas and timing patterns. Set different rates for + day/night hours to optimize bandwidth usage and avoid detection. +

+ Manage Schedule +
+
+
+ +
+
+
+
🔧 System Configuration
+

+ Adjust global settings including daily volume limits, download paths, + and scraper module selection for optimal performance. +

+ System Settings +
+
+
+ +
+
+
+
📊 Activity Logs
+

+ Monitor system activity, track scraping progress, and troubleshoot issues + with comprehensive logging and activity timeline views. +

+ View Logs +
+
+
+
+ + +
+
+

🔬 Advanced Features

+
+ +
+
+
+
🗄️ Database Management
+

+ Manage your paper database with tools for generating test data, + cleaning up records, and database maintenance operations. +

+ Database Tools +
+
+
+ +
+
+
+
🧠 Intelligent Processing
+

+ Background daemon with randomized timing, human-like behavior patterns, + and automatic retry mechanisms for robust paper collection. +

+
+ Auto-Retry + Smart Timing + Rate Limiting +
- {% endblock content %} \ No newline at end of file