• Home
  • About
  • Archives
  • License
  • Contact

Webtropic

The digital lair of Nikos Anagnostou

Feeds:
Posts
Comments
« I call it “Relief”
A Calibre plugin for Greek book metadata »

Bookmeta: a book metadata extractor for Greek books

January 15, 2012 by Nikos Anagnostou

You know how it is: you want something that no program/service provides and you end up making it yourself.

In my case, what I wanted was a metadata source for my books in calibre.

I use calibre not only for tracking ebooks (which is what is made for), but also for normal, printed books.

I have no problem with the English and French books of my library since the metadata sources provided by calibre are more than enough.

But I also own a big number of Greek books and, there, the results I get through calibre are quite poor: for very few greek books there are metadata available(provided by Google). And these are transcribed into Latin, which is odd to read.

Now, the most comprehensive source of greek books, especially the last 20 years production, is found in the Biblionet database. But there is no web api and the  semantics of their web pages are poor if existent at all. Which means that in order to get book metadata, one has to scrape their web pages.

The simplest query (and the most reliable) to perform in biblionet is to search by isbn, as this returns only one  record (or none, if the isbn is not found).

So what i did, was to make a simple web app(?) that searches a book by isbn in biblionet, parses the result page and returns a json or html output. I call it bookmeta.

Here is a sample output:

{"biblionetid" : "44201", "cover_url" : "http://biblionet.gr/images/logo_gr.jpg", "title" : "Από το Βυζάντιο στην Αναγέννηση", "authors" : "N. G. Wilson", "translators" : "Φωτεινή Πρεβεδούρου - Γεωργίνη", "publisher" : "Εκδοτικός Οίκος Α. Α. Λιβάνη", "yr_published" : "1994.", "original_language" : " αγγλικά", "original_title" : " From Byzantium to Italy", "categories" : "Ευρώπη - Ιστορία - Αναγέννηση [DDC: 940.21]"}

You can find the code at github.

This script  isn’t meant for end users. It is for people who might want to use this in another product or service. Help yourselves folks!

Next steps?

I am now developing a calibre plugin that does exactly the same thing, and since my python knowledge is close to zero, I am going to use this hack to get book metadata in the plugin almost without  any further parsing in python (for v.1 at least).

Share this:

Like this:

Like
Be the first to like this post.

Posted in Code | Tagged biblionet, books, ebooks, greek, greek books, json, metadata sources | 1 Comment

One Response

  1. on January 19, 2012 at 8:06 pm A Calibre plugin for Greek book metadata « Webtropic

    [...] Comments « Bookmeta: a book metadata extractor for Greek books [...]



Comments are closed.

  • TwitterCounter for @nikan
  • Enter your email address to follow this blog and receive notifications of new posts by email.

    Join 7 other followers

  • Recent Comments

    Iosif Kanakaris on Responsive Images: the solutio…
    Nikos Anagnostou on Responsive Images: the solutio…
    Nikos Anagnostou on Responsive Images: the solutio…
    Iosif Kanakaris on Responsive Images: the solutio…
    Nikos Anagnostou on A Calibre plugin for Greek boo…
  • Recent Posts

    • Snap Snare by Snare Complex
    • Responsive Images: the solutions so far and a mixed new one
    • A Calibre plugin for Greek book metadata
    • Bookmeta: a book metadata extractor for Greek books
    • I call it “Relief”
  • Categories

    • Business (21)
      • Events (8)
      • Startups (7)
    • Ideas (7)
    • Miscellaneous (25)
      • liteblogging (3)
      • People (3)
      • Personal (4)
    • Social media (22)
    • Technology (44)
      • Code (6)
      • HowTo (7)
      • Tech News (11)
    • Web Design (1)
  • My Tweets

    • “Steve wore the same thing every day because he didn’t want to think about what he was wearing.” ht.ly/bgcaW:::3 days ago
    • Facebook (FB) Pushes Opera Shares To All-time High | ValueWalk ht.ly/be1yA:::4 days ago
    • Snap Snare by Snare Complex bit.ly/KBzG61:::1 week ago
  • premium wordpress themes
  • RSS Scicada

    • What is so novel about Pinterest?
    • Discussing the Social Organization
    • Next11 – Berlin 17/18 of May 2011
    • IOC10 – Our presentation about Social Shopping (Greek)
    • From Group buying to Group gifting
  • RSS GreekStartups.com

    • Startup 2.0 15-16/6 in Bilbao: Game developers heads up
    • vss feed
    • ErgoQ
    • Made in Greece
    • weendy

Blog at WordPress.com.

Theme: MistyLook by Sadish.


Follow

Get every new post delivered to your Inbox.

Powered by WordPress.com
loading Cancel
Post was not sent - check your email addresses!
Email check failed, please try again
Sorry, your blog cannot share posts by email.