Machine learning
Pytorch Graph Neural Networks
PyTorch Geometric is an extension of PyTorch to train GNNs on structured data. Their introduction by example page and list of collab notebooks (and associated videos) are very pedagogic and interesting to experiment with.
Cybersecurity
Conferences deadlines
This website summarize the upcoming cybersecurity main conferences with a countdown to their submission deadlines: sec-deadlines.
SQL Injection datasets
For training of SQL Injection detection classification models, the following datasets can be used:
- WAF-A-MOLE Labeled SQL statements dataset used in paper “WAF-A-MoLE: Evading Web Application Firewalls through Adversarial Machine Learning”.
- Kaggle SQL Injection labeled dataset. This is the cleaned version of dataset which is also widely used in the domain.
- Libinjection malicious payloads dataset.
CTFs
Some random tools.
- Webservices / social medias account finder: sherlock
- PrivEsc vulnerability scanner: linPEAS
- Online image analysis: Aperisolve
- SQL Injection detection: sqlmap, OWASP Zap
Software engineering:
- Commit messages specification: conventionalcommits
- Word, Excel, Powerpoint to markdown: markitdown
- A static analysis tool: semgrep
OSS
PDF Manipulation tool:
Any administrative procedure requires to manipulate (merge, sign, reorder…) pdf files. I do not feel confident on relying on an online service, especially when we are talking about sensitive information. Stirling-PDF seems like the perfect tool to be used (they provide a docker instance) to manipulate these files. In a rush or not able to start a local instance ? Framalab is providing an instance: https://stirling-pdf.framalab.org/.