Instructions
- A fully qualified domain name (FQDN), sometimes called an absolute domain name, consists of a list of domain labels representing the hierarchy from the lowest relevant level in the DNS to the top-level domain (TLD), all concatenated with a dot. In DNS zone files, a domain name must end with a dot to be resolved as a valid FQDN.
- On the other hand, a fully qualified URL (FQU) is an absolute URL that starts with a valid scheme, like http(s), and has a complete path of directories, if any, to a file. Like FQDNs, FQUs can only be interpreted in one way.
- Some times you just want to extract FQUs from an initial FQU, a piece of text, or from the source code of a file residing in your local machine. The file may consists of an email, blog, forum, twitter, or facebook message, or can be an htm(l), asp(x), doc(x), js, css, txt, or php file.
- On occasions, your target might reside in a remote server or another machine. Said page might be a search results page, an online catalog, a link hub page, or one from your competitor or prospective client. This tools can help you to extract FQUs from these sources.
- If a remote host or search engine blocks, limits, or obfuscates your request or the results, visit the target URL and submit the visible text or source code associated to said web address.
- To export results, click the tool output textarea. Copy/paste them as you usually would.
- Disclaimer Notice. To prevent abuses, we have limited the tool to the processing of the first 100,000 text characters.
Extracting FQUs from regular text
- Submit the text that you want to extract FQUs from.
- This can be the visible text of a web page, email message, Word document, or text file.
- The text can also be the source code of a web page or email message.
- You can also type in the text you are interested in analyzing.
Extracting FQUs from an initial FQU
- To extract FQUs from a previous one, submit the FQU you are interested in. Submit one web address at a time; otherwise the tool may not recognize your request.
- The web address you submit can be the FQU of a web site, article, blog, or search results page, among others.
Getting no results
- If you get no results, it may mean that the result is not available or that the target
- does not accept your request.
- accepts your request, but redirects you.
- accepts your request, but obfuscates its content.
- allows your request, but has no URLs.
- has relative URLs, but no absolute URLs.
- has other conditions that prevents crawling.
- No results might also be the result of a query typo.
Who can use it?
- Data miners, teachers, students, or anyone interested in extracting absolute URLs from online or offline documents.
Suggested Exercises
- Submit an initial URL or source code of a web page known to list absolute URLs.
- Extract FQUs from an email message by submitting its visible text or source code.
- Extract FQUs from a search results page or remote web address, like Bing, Google, DMOZ, your own site, your competitor's site, even this site, etc.
Feedback
Contact us for any suggestion or question regarding this tool.