How to download multiple files using wget? [closed]

Question

How to download multiple files using wget? [closed]

Navigation

#1 by (3 votes)
#2 by (1 votes)

0

Well, I need to download all the images from a folder on a server!

That is, there is a site www.tarararara.com/images

And there are 50 images! How do I download these images and put them in a folder on my server using the command wget ?

You need to download all the images from the site, to put it in the images folder of Localhost!

Any way to create a loop? Or something like that?

php wget

asked by anonymous 15.10.2016 / 01:44

2 answers

1

The syntax for doing this with wget o php , the answer is based on this language, you can try the following:

Download page content file_get_contents or cURL , or another form you know about.

Extract the links from the page, you can use preg_match or parse HTML with DOMDocument .

Download the file from a URL, you can use file_put_contents or cURL in conjunction with the fopen to open the file for writing.

Do the following:

To download page content, cURL :

function obterPagina($url) {
    $curl = curl_init();
    curl_setopt($curl, CURLOPT_URL, $url);
    curl_setopt($curl, CURLOPT_RETURNTRANSFER, true);

    return curl_exec($curl);
}

Note : You can include more options depending on the need. More information .

To extract the links from the page, with DOMDocument :

function obterLinks($url, $pagina, $extensoes = ['gif', 'jpg']) { // Extensões aceitas
    $dom = new DOMDocument;
    $links = [];

    if ($dom->loadHTML($pagina) !== false) {
        foreach ($dom->getElementsByTagName('a') as $link) { // Percorre todos os elementos com a tag "a"
            $href = $link->getAttribute('href');
            $extensao = pathinfo($href, PATHINFO_EXTENSION);

            if (in_array($extensao, $extensoes)) {
                $links[] = $url . $href;
            }
        }
        return $links;
    }
    return false;
}

To download the file, also cURL :

function baixarArquivo($url, $salvarComo, $timeout = 3600) {
    $curl = curl_init(); 
    $fp = fopen($salvarComo, 'w'); // Abre o arquivo para escrita

    if (!$fp)
        return false;

    $opts = array(CURLOPT_URL     => $url,
                  CURLOPT_FILE    => $fp,
                  CURLOPT_TIMEOUT => $timeout); // Define o timeout, o padrão é 1 hora

    curl_setopt_array($curl, $opts);

    $ret = curl_exec($curl);
    curl_close($curl);
    fclose($fp);

    return $ret !== false;
}

To use, do so:

$url = "http://www.tarararara.com/images/";
$pagina = obterPagina($url);

if ($pagina) {
    $links  = obterLinks($url, $pagina);

    if ($links) {
        foreach ($links as $link) {
            var_dump( baixarArquivo($link, basename($link)) ); // Salva na mesma pasta do script
        }
    }
}

15.10.2016 / 17:22

put checkbox value in email? PHP - Return sum of columns at specified date // PDO

score 3 · Accepted Answer

wget -nd -r -P /onde.vc.quer.salvar -A jpeg,jpg,bmp,gif,png http://odominio/imagem

The -np prevents recursion from being used in the directory parent, if it is not put it will attempt to download all the images that are on the site.
The -r allows recursion for child directories.
The -P will create a directory where all the images will be placed.
The -A indicates all types of files that you want.

The possibilities are great and the manual explains very well everything it can do:

link