2012-07-11 53 views
3

有人有一个如何从web GUI调用scrapyd的例子吗?从Web GUI调用scrapy

我想从Web GUI调用Scrapy来实现Web Crawler。

我发现这个例子,但不是为我工作:

http://mahmoud.abdel-fattah.net/2012/07/04/super-simple-and-basic-scrapyd-web-interface/ 

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" 
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> 
<html xmlns="http://www.w3.org/1999/xhtml"> 
    <head> 
     <title>Simple scrapyd web manager</title> 
     <script src="https://ajax.googleapis.com/ajax/libs/jquery/1.7.2/jquery.min.js"></script> 
     <script type="text/javascript"> 

      var scrapyd_url = 'http://localhost:6800/'; 

      var project_name = 'bkam'; 

      var all_spiders = new Array(); 
      var running_spiders = new Array(); 
      var finished_spiders = new Array(); 

      $(document).ready(function() { 
       $(".start_spider").live('click', function() { 
        spider_name = $(this).attr('rel'); 
        $.post(scrapyd_url + 'schedule.json', { 
         project : project_name, 
         spider : spider_name 
        }); 

       }); 

       $(".stop_spider").live('click', function() { 
        job_id = $(this).attr('rel'); 
        $.post(scrapyd_url + 'cancel.json', { 
         project : project_name, 
         job : job_id 
        }); 

       }); 

       function listjobs() { 
        $.getJSON(scrapyd_url + 'listjobs.json?project=' + project_name, function(data) { 
         var finished_items = []; 
         $.each(data.finished, function(key, val) { 
          finished_spiders.push(val.spider); 
          finished_items.push('<li>' + val.spider + '&nbsp;&nbsp;<a href="' + scrapyd_url + 'logs/' + project_name + '/' + val.spider + '/' + val.id + '.log">log</a></li>'); 
         }); 
         $('#finished_spiders').append(finished_items.join('')); 

         if (data.running.length > 0) { 
          var running_items = []; 
          $.each(data.running, function(key, val) { 
           running_spiders.push(val.spider); 
           running_items.push('<li>' + val.spider + '&nbsp;&nbsp;<a href="' + scrapyd_url + 'logs/' + project_name + '/' + val.spider + '/' + val.id + '.log">log</a>&nbsp;&nbsp|&nbsp;&nbsp<a href="#" class="stop_spider" rel="' + val.id + '">Stop</a></li>'); 
          }); 
          $('#running_spiders').append(running_items.join('')); 
         } else { 
          $('#running_spiders').append('<li>No Running spiders</li>'); 
         } 

        }); 
       } 

       listjobs(); 

       // Get spiders list 
       $.getJSON(scrapyd_url + 'listspiders.json?project=' + project_name, function(data) { 
        var items = []; 
        //console.log(data); 
        $.each(data.spiders, function(key, val) { 
         items.push('<li id="' + val + '">' + val + '&nbsp;&nbsp;<a href="#" class="start_spider" rel="'+ val + '">Start</a></li>'); 
        }); 

        $('#available_spiders').append(items.join('')); 
       }); 

      }); 
     </script> 
    </head> 
    <body> 
     <a href="#" onClick="window.location.reload">refresh</a> 
     <br /> 
     List of running spiders 
     <ul id="running_spiders"></ul> 
     List of finished spiders (First is latest) 
     <ul id="finished_spiders"></ul> 
     List of available spiders 
     <ul id="available_spiders"></ul> 
    </body> 
</html> 

我想运行的蜘蛛从Web GUI中点击按钮。请帮我... :(

回答

0

从我所看到的,这个例子试图联系scrapyd,我认为你没有运行。

如果您确信它在运行,尝试在访问浏览器http://localhost:6800/ - 它应该返回给你一些东西

+0

是的scrapyd正在命令行上运行,但是如果我在浏览器中输入:http:// localhost:6800,请不要让我显示项目正在运行: – Ajacs1104 2012-07-11 20:36:09

+0

你的浏览器显示你何时访问'http://127.0.0.1:6800'? – warvariuc 2012-07-12 04:09:13