2017-05-06 118 views
0

我正在构建一个电影推荐器。我的推荐引擎是用Python编写的。我通过node.js(Express)从网站运行它。从node.js运行Python脚本时出现parseError

python代码本身的工作,这里是当我从控制台运行它的输出。它是利用计算大熊猫和numpy的返回与电影的标题和其相似的一个选择电影中的基质,而且我还打印打招呼:

Python command code

在我的网站我有以下身体HTML

<form class="test" method="post" action="/test"> 
    <input type="text" name="user[name]"> 
    <input class="button" type="submit" value="Submit"> 
</form> 

JS客户端

(function($) { 

    $(document).ready(function() { 
     var btn = $('.button'), 
      input = $('input'); 
     btn.on('click', function() { 
     e.preventDefault(); 
     }) 
    }) 
})(jQuery) 

JS服务器端,与快速

var express = require('express'); 
var app = express(); 
var path = require('path'); 
var bodyParser = require('body-parser'); 
var PythonShell = require('python-shell'); 

var options = { 
    mode: 'text', 
    pythonOptions: ['-u'], 
    scriptPath: "E:/Praca Magisterska/Python", 
}; 

app.use(express.static(path.join(__dirname, ''))); 
app.use(bodyParser.json()); 
app.use(bodyParser.urlencoded({ 
    extended: true 
})); 

app.get('/', function (req, res) { 
    res.sendFile(path.join(__dirname+'/index.html')); 
}) 

app.post('/test', function (req, res) { 
    console.log(req.body); 

    PythonShell.run('similarMovies.py', options, function (err, results) { 
    if (err) throw err; 
    // results is an array consisting of messages collected during execution 
    console.log('results: %j', results); 
    }); 

}) 

app.listen(3000, function() { 
    console.log('Example app listening on port 3000!'); 
}) 

那么,它是如何工作的。点击提交btn我正在发射我的node.js来运行一个python脚本,然后是console.log的结果。不幸的是,我收到错误,最后形象。

然而,当我不运行功能,而不是它,我写在我的Python刚刚结束:

print "hello" 
print 2 

代码的结果被解析不错。

Image of command an results

可能是什么问题? Erros,我得到除以零和其他功能内?但是,如果是,为什么后来当我直接从CMD运行它,它正在 - python similarMovies.py

这里是蟒蛇代码:

# -*- coding: utf-8 -*- 
import pandas as pd 
import numpy as np 

def showSimilarMovies(movieTitle, minRatings): 

     # import ratingów z pliku csv 
    rating_cols = ['user_id', 'movie_id', 'rating'] 
    rating = pd.read_csv('E:/Praca Magisterska/MovieLens Data/ratings.csv', names = rating_cols, usecols = range(3)) 

    # import filmów z pliku csv 
    movie_cols = ['movie_id', 'title'] 
    movie = pd.read_csv('E:/Praca Magisterska/MovieLens Data/movies.csv', names = movie_cols, usecols = range(2)) 

    # łączenie zaimportowanych ratingów oraz filmów, usuwanie pierwszego wiersza 
    ratings = pd.merge(movie, rating) 
    ratings = ratings.drop(ratings.index[[0]]) 
    # konwertowanie kolumn ze stringów na numeric 
    ratings['rating'] = pd.to_numeric(ratings['rating']) 
    ratings['user_id'] = pd.to_numeric(ratings['user_id']) 

    # tworzenie macierzy pokazująceje oceny filmów przez wszystkich użytkowników. 
    movieRatingsPivot = ratings.pivot_table(index=['user_id'], columns=['title'], values='rating') 

    # filtrowanie kolumny do obliczania filmów podobnych 
    starWarsRating = movieRatingsPivot[movieTitle] 

    # obliczanie korelacji danego filmu z każdym innym i wyrzucanie tych z którymi nic go nie łączy 
    similarMovies = movieRatingsPivot.corrwith(starWarsRating) 
    similarMovies = pd.DataFrame(similarMovies.dropna()) 

    # zmiana nazwy kolumny oraz sortowanie według rosnącej korelacji 
    similarMovies.columns = ['similarity'] 
    similarMovies.sort_values(by=['similarity'], ascending=False) 

    # tworzenie statystyk dla filmów, size to ilość ocen, a mean to średnia z ocen 
    # zgrupowane po tytułach 
    movieStats = ratings.groupby('title').agg({'rating': [np.size, np.mean]}) 

    # popularne filmy, które mają więcej niż 100 ocen 
    popularMovies = movieStats['rating']['size']>=minRatings 

    # sortowanie popularnych filmów od najwyższej średniej 
    movieStats[popularMovies].sort_values(by=[('rating', 'mean')], ascending=False) 

    # łączenie popularnych filmów z filmami podobnymi do filtrowanego filmu i ich sortowanie 
    moviesBySimilarity = movieStats[popularMovies].join(similarMovies) 
    x = moviesBySimilarity.sort_values(by='similarity', ascending=False) 
    k = x.drop(x.columns[[0, 1]], axis = 1) 
    k = k.drop(x.index[[0]]) 
    return k 

print "hello"  
print 2 
showSimilarMovies('Star Wars: Episode VI - Return of the Jedi (1983)', 300) 
+0

考虑将'showSimilarMovies'调用包含在'try ... except BaseException中作为e:with open(“error.txt”,“w”)作为f:f.write(repr(e))''。这个想法是在某处登录异常,所以你可以看到究竟是什么崩溃。 – drdaeman

+0

@drdaeman 哎,我想: '开放( “error.txt”, “W”)为f: 尝试: showSimilarMovies( '星战前传VI - 绝地归来(1983年)',300 ) (BaseException除外)e: f.write(repr(e))' 不幸的是,error.txt是空的。当我尝试你的版本时,它甚至没有创建一个error.txt – Pacxiu

+0

嘿,当我在函数showSimilarMovies中评论了一切 - 行'starWarsRating = movieRatingsPivot [movieTitle]'并将该变量作为函数的输出打印出来,一切正常。 因此,python Shell解析器由于来自numpy和pandas的警告而以某种方式结束脚本,这些警告显示在命令和结果的图像中,我如何忽略这些错误并告诉节点只是将脚本运行到最后? – Pacxiu

回答

1

我想通了,只是增加了两行Python文件的beggining忽略警告:

import warnings 

warnings.filterwarnings('ignore') 

现在我的输出是想要的。