MCPcopy
hub / github.com/arxhr007/Aliens_eye

github.com/arxhr007/Aliens_eye @v2.2.1 sqlite

repository ↗ · DeepWiki ↗ · release v2.2.1 ↗
320 symbols 1,370 edges 47 files 63 documented · 20%
README

ALIENS EYE

Aliens Eye Logo

AI-OSINT Username Scanner

Advanced AI-Powered Social Media Username Finder

Scan 840+ platforms with ML-blended detection

PyPI CI Python Stars License

Highlights

  • 840+ platforms scanned asynchronously in seconds
  • ML + heuristic detection — a trained model blended with 30 structural signals (HTTP status, DOM shape, keywords, fingerprints) instead of naive status-code checks
  • Profile extraction — display name, bio, and avatar pulled from each hit (OpenGraph / JSON-LD / per-site CSS)
  • Cross-site correlation — cluster profiles that look like the same person by avatar hash, bio, shared links, and name (--correlate)
  • Recursive expansion — follow linked usernames out of bios and re-scan them (--recurse-depth N)
  • Domain check — is <username>.{com,io,net,…} registered and live? (--domains)
  • Watch mode — re-scan on an interval and alert on changes, optionally to a webhook (--watch 6h --notify <url>)
  • Resumable scans — checkpoint progress and continue after an interruption (--resume file.jsonl)
  • Modern terminal UI — live progress, sorted result tables, summary panels (powered by rich); plus an interactive browser (aliens_eye tui, optional extra)
  • MCP server — expose scanning to LLM agents (aliens_eye serve, optional extra)
  • Proxy & Tor support--proxy socks5://... or just --tor
  • Site filtering--site github,reddit, --exclude-site, --no-nsfw, plus drop-in sites.d/ plugin site maps
  • Calibrated self-checkaliens_eye selfcheck reports precision / recall / F1 / FPR per site
  • Retrainable + active learning — retrain with aliens_eye train, or hand-label uncertain hits with aliens_eye label
  • Reports in JSON, CSV, HTML, Markdown, PDF, and graph formats (GEXF, Mermaid, Maltego CSV)
  • Playwright fallback for JavaScript-heavy pages (optional extra)

Install

pip install aliens-eye

Optional extras:

pip install "aliens-eye[browser]"   # Playwright fallback for hard pages
python -m playwright install chromium

pip install "aliens-eye[train]"     # scikit-learn, for retraining the ML model
pip install "aliens-eye[correlate]" # Pillow, for avatar-image matching in --correlate
pip install "aliens-eye[pdf]"       # reportlab, for --format pdf
pip install "aliens-eye[tui]"       # textual, for the interactive `tui` browser
pip install "aliens-eye[serve]"     # mcp, for the `serve` MCP server

Or with Docker:

docker build -t aliens-eye .
docker run --rm -it aliens-eye username

From source:

git clone https://github.com/arxhr007/Aliens_eye.git
cd Aliens_eye
pip install -e .

Usage

# Interactive prompts
aliens_eye

# Single username
aliens_eye username

# Multiple usernames
aliens_eye username1 username2

# Advanced scan level (prefix/suffix variations)
aliens_eye username -l advanced

# Only scan specific sites
aliens_eye username --site github,reddit,gitlab

# Skip NSFW sites
aliens_eye username --no-nsfw

# Route through Tor (needs a local Tor daemon)
aliens_eye username --tor

# Any HTTP or SOCKS proxy
aliens_eye username --proxy socks5://127.0.0.1:1080

# Export everything
aliens_eye username --format all --output results

# Heuristics only, no ML
aliens_eye username --no-ml

# Non-interactive preset: quick / full / aggressive
aliens_eye username --profile quick

# Plain output for scripts and CI (no colors/progress)
aliens_eye username --plain

# View results from a previous scan
aliens_eye -r results/username_advanced_20260611_120000.json

# Correlate hits into "likely same person" clusters + check domains
aliens_eye username --correlate --domains

# Follow linked usernames out of found bios and re-scan them
aliens_eye username --recurse-depth 1

# Export a graph of the results (import into Gephi / Maltego / Mermaid)
aliens_eye username --correlate --format gexf,mermaid,maltego

# Investigator PDF with embedded avatars
aliens_eye username --format pdf

# Watch for changes every 6 hours and POST them to a webhook
aliens_eye username --watch 6h --notify https://hooks.example/aliens

# Resume an interrupted scan
aliens_eye username --resume scan.jsonl

# Compare two saved reports
aliens_eye diff results/old.json results/new.json

# Validate detection accuracy (precision / recall / F1 per site)
aliens_eye selfcheck --negatives 2 --report json

# Interactively label uncertain hits into a training set
aliens_eye label results/username_basic_20260611_120000.json --out labeled.csv

# Interactive terminal browser (needs [tui])
aliens_eye tui username

# Run the MCP server for LLM agents (needs [serve])
aliens_eye serve

Custom platforms: drop a { "site_name": "https://site/{}" } JSON file into ./sites.d/ (or the user config dir's sites.d/) and it is merged automatically; --sites-dir DIR adds another location.

How detection works

Every response is converted into a 30-dimensional feature vector: HTTP status buckets, username placement (path/title/meta/canonical), error and profile keywords, DOM structure (images, forms, profile/error CSS classes), structured-data signals (og:type, JSON-LD Person), response timing, redirect counts, and per-site fingerprint matches learned from previous scans.

Two judges then vote:

  1. Heuristic engine — weighted scoring over the features
  2. ML model — logistic regression trained on labeled scans of real (and deliberately fake) accounts, shipped with the package and running in pure Python (no sklearn needed at runtime)

The blended probability maps to Found / Maybe / Not Found with a confidence percentage. If a model file is missing or invalid, the scanner silently falls back to heuristics.

Retraining the model

pip install "aliens-eye[train]"

# 1. Scan ground-truth accounts + random non-existent usernames to build a dataset
aliens_eye train collect --out dataset.csv --negatives 4

# 2. Fit and export the model
aliens_eye train fit --data dataset.csv --out model.json

# 3. Use it
aliens_eye username --model model.json

Configuration

Aliens Eye merges a JSON config file with CLI flags (CLI wins). Search order without --config: ./config.json, then the platform config dir (e.g. ~/.config/aliens_eye/config.json on Linux, %LOCALAPPDATA%\aliens_eye on Windows).

{
  "concurrent": 50,
  "timeout": 10.0,
  "retries": 2,
  "rate_limit_delay": 0.2,
  "output_dir": "results",
  "output_formats": ["json", "csv", "html", "md"],
  "use_playwright": false,
  "proxy": null,
  "use_ml": true,
  "exclude_nsfw": false,
  "level": "basic"
}

Outputs

Results are saved with timestamped filenames:

  • username_level_YYYYMMDD_HHMMSS.json — full detail including per-site feature analysis
  • .csv — flat rows for spreadsheets
  • .html — styled standalone report
  • .md — Markdown summary of Found/Maybe hits

Architecture

The package lives under src/aliens_eye/: core/ (scanner, detector, analyzer, http, exporter, fingerprints), ml/ (inference, training, dataset collection), utils/ (rich console layer), and data/ (sites.json, trained model, ground-truth sets). For internals and flowcharts, see WORKING.md.

Contributing

Issues and PRs welcome — adding sites to src/aliens_eye/data/sites.json, expanding the ground-truth set in selfcheck.json, or improving the model all directly improve detection. Run pytest and ruff check src tests before submitting.

Disclaimer

This tool is for educational purposes and legitimate OSINT research only. You are responsible for complying with laws and site terms of service.

Core symbols most depended-on inside this repo

get_console
called by 17
src/aliens_eye/utils/console.py
save_results
called by 10
src/aliens_eye/core/exporter.py
predict_proba
called by 10
src/aliens_eye/ml/inference.py
predict
called by 9
src/aliens_eye/core/detector.py
load_sites_data
called by 9
src/aliens_eye/core/scanner.py
set_plain
called by 8
src/aliens_eye/utils/console.py
from_dict
called by 8
src/aliens_eye/ml/inference.py
setup_logger
called by 7
src/aliens_eye/utils/logger.py

Shape

Function 222
Method 78
Class 20

Languages

Python100%

Modules by API surface

src/aliens_eye/cli.py31 symbols
src/aliens_eye/core/correlate.py22 symbols
tests/test_new_features.py19 symbols
src/aliens_eye/core/exporter.py16 symbols
src/aliens_eye/tui/app.py14 symbols
src/aliens_eye/core/scanner.py14 symbols
tests/test_scanner.py13 symbols
src/aliens_eye/core/fingerprints.py12 symbols
tests/test_ml_inference.py11 symbols
tests/test_detector.py11 symbols
src/aliens_eye/utils/console.py11 symbols
tests/test_config_cli.py10 symbols

Dependencies from manifests, versioned

aiohttp3.8 · 1×
platformdirs4.0 · 1×
rich13.7 · 1×
selectolax0.3.21 · 1×

For agents

$ claude mcp add Aliens_eye \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact