Files
gists/tools/data/geturls.py
tobias 619b0bc432 Restructure repository: organize tools by purpose, create what search tool
- Move single-file tools to tools/ organized by category (security, forensics, data, etc.)
- Move multi-file projects to projects/ (go-tools, puzzlebox, timesketch, rust-tools)
- Move system scripts to scripts/ (proxy, display, setup, windows)
- Organize config files in config/ (shell, visidata, applications)
- Move experimental tools to archive/experimental
- Create 'what' fuzzy search tool with progressive enhancement (ollama->fzf->grep)
- Add initial metadata database for intelligent tool discovery
- Preserve git history using 'git mv' commands
2026-02-21 23:20:42 +01:00

32 lines
746 B
Python
Executable File

#!/usr/bin/env python3
import sys
from bs4 import BeautifulSoup
if sys.argv[1].startswith("http://") or sys.argv[1].startswith("https://"):
import requests
response = requests.get(sys.argv[1])
data = response.content
else:
with open(sys.argv[1],'rt',encoding='ISO-8859-1') as f:
data=f.read()
page=str(BeautifulSoup(data,features="lxml"))
def getURL(page):
start_link = page.find("a href")
if start_link == -1:
return None, 0
start_quote = page.find('"', start_link)
end_quote = page.find('"', start_quote + 1)
url = page[start_quote + 1: end_quote]
return url, end_quote
while True:
url, n = getURL(page)
page = page[n:]
if url:
print(url)
else:
break