Montag, 29. Juni 2015

Wget - copy Website - Windows

How to get Wget?


Open the terminal.
In windows press "windows-key" + R
"cmd"

Download html pages:

This is the windows code.

wget -p --no-parent --convert-links -l3 -
Pdownload URL http://www.uni-tuebingen.de/

-p = This option causes Wget to download all the files that are necessary to properly display a given HTML page.
--no-parent = Do not ever ascend to the parent directory when retrieving recursively. This is a useful option, since it guarantees that only the files below a certain hierarchy will be downloaded.
Important for Typo3 Website
--convert-links = convert the links in the downloaded files to point to local files
-l3 = i.e. level 3; Specify recursion maximum depth level depth
-Pdownload = download to current directory, i. e. -Pc:\temp\test, download the files to c:\temp\test folder.
URL = Website ti copy, i.e.  http://www.uni-tuebingen.de/

Also try
--mirror
Turn on options suitable for mirroring. This option turns on recursion and time-stamping, sets infinite recursion depth and keeps FTP directory listings. It is currently equivalent to ‘-r -N -l inf --no-remove-listing’. 

Download pictures

wget -A .jpg,.png,.gif -p --no-parent --c
onvert-links -l3 -Pdownload http://www.uni-tuebingen.de/

-A = Types of files (accept list)

Also useful
--ignore-case = .JPG or -jpg will be found




Keine Kommentare: