|  | 
| Snapshot of a text file saved under the comma-separated value format (Photo credit: Wikipedia) | 
Here's the code :-
from lxml.html import parse
__author__ = 'Psycho_Coder'
def main():
    with open("mangalist.csv", "w") as f:
        tree = parse("http://www.mangapanda.com/alphabetical")
        manga_name_list = tree.xpath("//ul[@class='series_alpha']/li/a/text()")
        manga_url_list = tree.xpath("//ul[@class='series_alpha']/li/a/@href")
        f.write("\"Manga Name\", URL\n")
        for i in range(len(manga_name_list)):
            f.write("\"{0}\", http://www.mangapanda.com{1}\n".format(manga_name_list[i].replace("\"", ""), manga_url_list[i]))
if __name__ == "__main__":
    main() 
CSV : https://github.com/AnimeshShaw/MangaScrapper/blob/master/resources/mangalist.csv
Code On Github https://github.com/AnimeshShaw/MangaScrapper/blob/master/resources/MangaList.py
I hope you liked this script. Please share this post in your social network :)







 
 
0 comments :
Post a Comment