我遇到了一个猪脚本的问题,我尝试了很多不同的方法。任何人都可以指出我究竟做错了什么?它应该是非常简单的,我试图在计算平均值后得到最大值。pigscript error not calculation max
a = LOAD 'default.books' using org.apache.hcatalog.pig.HCatLoader();
b = LOAD 'default.book_rating' using org.apache.hcatalog.pig.HCatLoader();
books_and_ratings = join a by isbn, b by isbn;
by_isbn = GROUP books_and_ratings BY (a::isbn);
DESCRIBE by_isbn;
average_book_rating = FOREACH by_isbn
GENERATE books_and_ratings.book_title, books_and_ratings.a::isbn as isbn1,
books_and_ratings.book_author, books_and_ratings.publisher,
AVG(books_and_ratings.book_rating) as AVG_RATING;
DESCRIBE average_book_rating;
group_avg = GROUP average_book_rating ALL;
DESCRIBE group_avg;
max_avg_rating = FOREACH group_avg
GENERATE FLATTEN average_book_rating.a::book_title, isbn1,
average_book_rating.a::book_author, average_book_rating.a::publisher, MAX(AVG_RATING);
dump max_avg_rating;
解析失败:不匹配的输入 'average_book_rating' 期待LEFT_PAREN
您是否收到错误,或者只是没有正确计算最大值? – Eyal 2014-09-28 13:56:13
@eyal实际上得到一个错误.... – Hades 2014-09-28 20:24:43
计算max_avg_rating的最后一个stmt不正确。你能粘贴确切的错误吗? – 2014-09-29 00:48:24