Multilingual Corpus
Download the complete philosophical and cultural writings:
Structured Data
Hugging Face:
JSON + DOLMA format
– ready for ML training
Kaggle:
CSV format
– optimized for data science
Websites Archive
GitHub:
HTML + CSS + PDF + Images
– full websites dump
License
License Page:
CC BY 4.0
– use freely, even commercially