news-please - an integrated web crawler and information extractor for news that just works
-
Updated
Sep 21, 2025 - Python
news-please - an integrated web crawler and information extractor for news that just works
Open source toolkit for scraping, OSINT and more.
Portal Tutorial
Python async data gathering
The Job Crawler is an integral component of the Job RAID project, designed to automatically scrape and collect data from various job listing websites. This crawler enables Job RAID to aggregate comprehensive job listings, ensuring that users have access to up-to-date and relevant job opportunities.
Data gathering from https://cafebazaar.ir
An application to watch the Twitter stream and send accounts to the Botometer API for analysis. The results are stored in a SQLite database.
Code and slides for my class: Data Gathering & Wrangling
ML/DL dataset collection utilities
Foreground application logger for Windows
Financial Datareader
Face Detection => Data Gathering => Training => Face Recognition
This is the project for Karnataka Police Hackathon
Exports IPMI sensor information to a CSV
Minecraft Server Finder is a small toolkit which helps in finding Minecraft servers and tracking players using the "sample" parameter.
A set of python utilities to automatically exfiltrate system data.
This is a collection of scripts used to semi-automate the collecting of traffic data over Mullvad VPN, using Ubuntu virtual machines generating the traffic.
ROSE-AP of the FlexHex project tries to make data gathering from Orion Context Broker entities to Influx-db easier.
BUM (Bayesian User Model): A User Modelling Technique for Learning from Distributed Devices.
Add a description, image, and links to the data-gathering topic page so that developers can more easily learn about it.
To associate your repository with the data-gathering topic, visit your repo's landing page and select "manage topics."