| Snapshot of a text file saved under the comma-separated value format (Photo credit: Wikipedia) |
Here's the code :-
from lxml.html import parse
__author__ = 'Psycho_Coder'
def main():
with open("mangalist.csv", "w") as f:
tree = parse("http://www.mangapanda.com/alphabetical")
manga_name_list = tree.xpath("//ul[@class='series_alpha']/li/a/text()")
manga_url_list = tree.xpath("//ul[@class='series_alpha']/li/a/@href")
f.write("\"Manga Name\", URL\n")
for i in range(len(manga_name_list)):
f.write("\"{0}\", http://www.mangapanda.com{1}\n".format(manga_name_list[i].replace("\"", ""), manga_url_list[i]))
if __name__ == "__main__":
main()
CSV : https://github.com/AnimeshShaw/MangaScrapper/blob/master/resources/mangalist.csv
Code On Github https://github.com/AnimeshShaw/MangaScrapper/blob/master/resources/MangaList.py
I hope you liked this script. Please share this post in your social network :)






0 comments :
Post a Comment