Please read. Significant change on the site that will affect compatibility [ Dismiss ]
Home ยป Forum ยป Author Hangout

Forum: Author Hangout

Conversion software for epub

StarFleet Carl ๐Ÿšซ

Question -

When you're taking your completed html file and converting it into an epub for Bookapy, what software do you use?

It's easy to create a pdf file. Keep in mind that since I've never actually done this before (with Book One, since it was complete on SOL, I simply copied the epub file from the original site) - I've created a file and uploaded it. I'm hearing that it's not created the chapters, and since I've had to post a couple of changes (justifiably, based upon the criticism I got from Lazeez as the publisher), I'm screwed up a little.

I have .odt and .html files on my computer, but so far I've just used an epub converter I found online. Is there a certain or better one that you all use?

The Outsider ๐Ÿšซ
Updated:

@StarFleet Carl

If you have the .html, you could build the .ePub yourself, which is what I did for Skyscrapers; I used Sigil for Mac, but they have a Windows version also. It's not hard to build a standard shell for one then reuse it again.

It's not automatic, though. You'd have to build a table of contents and a .CSS file, but you also don't have to go crazy.

Switch Blayde ๐Ÿšซ
Updated:

@StarFleet Carl

I use Calibre.

My input is a docx file and I do three Calibre conversions from it for Bookapy: epub, mobi, PDF. I'm sure Calibre accepts ODT files as input as well.

If you define the chapter headings as H1 Calibre will create the Table of Contents.

I shouldn't have to put a page break before each chapter heading, but I do. I've had chapters in the epub that didn't start on a new page so I put the page break before them all.

Replies:   Crumbly Writer
Crumbly Writer ๐Ÿšซ

@Switch Blayde

I shouldn't have to put a page break before each chapter heading, but I do. I've had chapters in the epub that didn't start on a new page so I put the page break before them all.

Under the html settings section, you need to define the page-break for the h1 and/or h2 commands using the following:
< head>
< style>

< head>
< style>
h1
{
page-break-before:always;
}
< /style>
< /head>

You can also center, increase the space and below or make it bold or italicized via the same html Style definitions.

Replies:   Switch Blayde
Switch Blayde ๐Ÿšซ

@Crumbly Writer

Under the html settings section, you need to define the page-break for the h1

I do the formatting in Word so I use "Heading 1" to format the chapter headings. Calibre converts that to the appropriate HTML.

Replies:   Crumbly Writer
Crumbly Writer ๐Ÿšซ

@Switch Blayde

I do the formatting in Word so I use "Heading 1" to format the chapter headings. Calibre converts that to the appropriate HTML.

That's understood, but there's no clear cut way of forcing < H1> headers to create a page break other than manually inserting the page breaks, which don't work in html (i.e. ePubs). My suggestion was how to add it back.

Replies:   Switch Blayde
Switch Blayde ๐Ÿšซ

@Crumbly Writer

but there's no clear cut way of forcing < H1> headers to create a page break

Calibre must recognize Word's page break and does the magic in HTML.

Crumbly Writer ๐Ÿšซ

@Switch Blayde

Calibre must recognize Word's page break and does the magic in HTML.

Sorry, but I've checked several Calibre produced ePubs, and they don't carry over the manual page breaks from WORD (which is why I properly coded the CSS entry in my ePubs).

Replies:   Ernest Bywater
Ernest Bywater ๐Ÿšซ

@Crumbly Writer

Sorry, but I've checked several Calibre produced ePubs, and they don't carry over the manual page breaks from WORD (which is why I properly coded the CSS entry in my ePubs).

That's an issue with the way MS Word codes the format data, not with how Calibre handles it.

As to the page break within an epub within Calibre, the default is to have a page break before a H1 so the headings work in the html.

Mind you Libre Office now has a one-click process to create epubs, and it breaks up the html code based on the H1 Heading style.

Switch Blayde ๐Ÿšซ

@Ernest Bywater

As to the page break within an epub within Calibre, the default is to have a page break before a H1 so the headings work in the html.

Except every once in a while the epub generated from Calibre did not have a page break before a chapter heading which was defined as "heading 1" (H1). But when I put a Word page break before the chapter headings I never have the problem.

Ernest Bywater ๐Ÿšซ

@Switch Blayde

Except every once in a while the epub generated from Calibre did not have a page break before a chapter heading which was defined as "heading 1" (H1). But when I put a Word page break before the chapter headings I never have the problem.

I never had that problem happen to me.

Replies:   Switch Blayde
Switch Blayde ๐Ÿšซ

@Ernest Bywater

I never had that problem happen to me.

It's happened to me several times when using the Calibre e-book Viewer to check out the results of the epub created. It's never happened since I put the page breaks before the chapter headings.

I was afraid I would end up with a blank page before each chapter (two page breaks) but that never happened.

Replies:   Crumbly Writer
Crumbly Writer ๐Ÿšซ

@Switch Blayde

It's happened to me several times when using the Calibre e-book Viewer to check out the results of the epub created. It's never happened since I put the page breaks before the chapter headings.

Be careful using the Calibre default ePub viewer, as it doesn't always reflect the coding within the ePub, as it's Calibre's own rendition engine, rather than a true ePub viewer. I typically default to that view as I usually search for specific data (i.e. which version of the ePub did I just install (ex: Amazon, Smashwords or Bookapy), and it's easier to navigate.

Crumbly Writer ๐Ÿšซ

@Switch Blayde

But when I put a Word page break before the chapter headings I never have the problem.

Have you checked your ePub code? (i.e. does Calibre transfer the manual page break into the eBook, or does it merely trigger the default < h1> behavior?) I was always under the impression that a Word based page break isn't transferred, since it's not part of a standard style definition (i.e. it's simply an 'in-line' format, rather than a persevered CSS style definition).

Replies:   Switch Blayde
Switch Blayde ๐Ÿšซ

@Crumbly Writer

Have you checked your ePub code?

To do that is way above my pay grade.

Crumbly Writer ๐Ÿšซ

@Ernest Bywater

That's an issue with the way MS Word codes the format data, not with how Calibre handles it.

Sorry, but I code ALL of my ePubs by hand (i.e. I convert the text to html via M$ Word, but then swap out the code to the 'cleaned up' version I typically use with each of my books.

Anyway, if anyone is having trouble getting their ePubs to page break for each section, the code I showed IS the way to get it to process it correctly. But, if you're not having a problem, than just quit kvetching already!

Replies:   Ernest Bywater
Ernest Bywater ๐Ÿšซ

@Crumbly Writer

Sorry, but I code ALL of my ePubs by hand

CW, please make up your mind as to what you're doing and complaining about!

Before you said

Sorry, but I've checked several Calibre produced ePubs, and they don't carry over the manual page breaks from WORD

which I said would be a MS Word code issue, not a Calibre issue. Now you talk about using html. So why did you mention the MS Word page break to start with if you aren't using it?

Replies:   bk69
bk69 ๐Ÿšซ

@Ernest Bywater

He claimed to use Word to produce the HTML, from which he'd strip out some of the worst crap in the header. Presumably he'd leave the inline commands and insert a few different definitions.

Ernest Bywater ๐Ÿšซ

@Switch Blayde

Calibre must recognize Word's page break and does the magic in HTML.

Actually, I think you'll find it recognises the Heading 1 and applies that then breaks up the html as per the H1 code.

Michael Loucks ๐Ÿšซ

I use Scrivener and my original input is text files. Scrivener generates clan Kindle, ePub, and PDF files.

Ernest Bywater ๐Ÿšซ

@StarFleet Carl

Libre Office now has the ability to make an epub from your .odt file. However, I create a html file for web use and have found the best result for an epub is to have a good css style sheet code in the html and use that to make the epub by using Calibre.

StarFleet Carl ๐Ÿšซ

Thank you all for your help.

I was up way too damn late last night getting things to work properly. I think the final package is worth it.

I'm completely amused, by the way, from how this has gone in one year from, gee, I'm posting something to help pay back and get people to the website and maybe make a little money for a pizza, to I've sold HOW many copies, and crap, this isn't just simple stuff anymore, is it?

Replies:   Crumbly Writer
Crumbly Writer ๐Ÿšซ

@StarFleet Carl

I'm completely amused, by the way, from how this has gone in one year from, gee, I'm posting something to help pay back and get people to the website and maybe make a little money for a pizza, to I've sold HOW many copies, and crap, this isn't just simple stuff anymore, is it?

As always, the more you learn, the more you realize how little you know. And with something like Independent Publishing, where you aren't just learning how to write, but how to format documents, publish, create website and market you're own works, you're always behind the 8-ball, no matter how much experience you eventually build up. ;)

Replies:   StarFleet Carl
StarFleet Carl ๐Ÿšซ

@Crumbly Writer

As always, the more you learn, the more you realize how little you know.

I'm not saying this to be taken at anything other than face value.

I was writing these stories because I've always wanted to be author, and finding SOL allows me to not only write material, but to post it so that other people can either enjoy it or not at their own discretion is almost like my childhood dream finally coming true.

The minor detail that I've had people on here that like my work so much that they're willing to pay for something I'm posting for free is like the ultimate compliment.

Switch Blayde ๐Ÿšซ
Updated:

@StarFleet Carl

I just happened to stumble upon this in Calibre documentation:

What are the best source formats to convert?

In order of decreasing preference: LIT, MOBI, AZW, EPUB, AZW3, FB2, FBZ, DOCX, HTML, PRC, ODT, RTF, PDB, TXT, PDF

I assume "order of decreasing preference" means the first (LIT) is the best and the last (PDF) is the worst.

So it prefers DOCX over ODT. And surprisingly, DOCX is more preferred than HTML. I'm also surprised that MOBI is more preferred than EPUB.

Ernest Bywater ๐Ÿšซ

@Switch Blayde

So it prefers DOCX over ODT. And surprisingly, DOCX is more preferred than HTML. I'm also surprised that MOBI is more preferred than EPUB.

My understanding from comments seen about Calibre in the past, is that is a 'user' preference list in that it shows the order of frequency of the users. I always took it to mean that the most frequently converted files were Lit being converted to something else or converted from Lit, then so on down the list.

Replies:   Crumbly Writer
Crumbly Writer ๐Ÿšซ

@Ernest Bywater

I always took it to mean that the most frequently converted files were Lit being converted to something else or converted from Lit, then so on down the list.

That makes sense, as almost no browser software continues to support the older .LIT format, so users who want to preserve older text need to convert it to another, supported format.

Ernest Bywater ๐Ÿšซ

@Switch Blayde

I just happened to stumble upon this in Calibre documentation:

Switch, I forgot to mention in the earlier post, what you have to remember is that Calibre is not a tool for creating e-pubs from scratch, although we use it as such. Calibre is a tool for converting e-books to other formats so people can read them on different readers to the readers they first purchased the e-books for. Thus the creators of Calibre have always had, as their primary focus, conversion from one e-book format to another and their coding reflects that aim. The later input types have been added as people have requested them. When I first started using Calibre the list of input options was only half of what it is now.

Dominions Son ๐Ÿšซ

@Switch Blayde

I'm also surprised that MOBI is more preferred than EPUB.

MOBI is an open format that is NOT proprietary to Amazon.

AZW & AZW3 are basically MOBI + Amazon DRM. There may be other extensions, but adding DRM was the driver behind creating AZW

Replies:   Crumbly Writer
Crumbly Writer ๐Ÿšซ

@Dominions Son

MOBI is an open format that is NOT proprietary to Amazon.

Maybe not, but it's almost used exclusively by Amazon users/customers (i.e. those who purchase their books from Amazon and then display it using ONLY the Amazon viewer. The main advantage of ePubs is that, since the rendering is based on html, the format is display correctly by each device type, since it defaults to the local browser's html engine to display the contents. Thus, the ePub is more 'universal' in its application, rather than in how it's used by most people.

But, since most people purchase their books via Kindle, that's the most frequently used format, as its the 800-pound gorilla in the room.

Replies:   Dominions Son
Dominions Son ๐Ÿšซ

@Crumbly Writer

Maybe not, but it's almost used exclusively by Amazon users/customers (i.e. those who purchase their books from Amazon and then display it using ONLY the Amazon viewer.

As I understand it, MOBI is a very early e-book format. It predates e-pub as well as Amazon getting into e-books.

However, it wasn't very popular before Amazon selected it as the base for the AZW format on Kindle. Kindle can handle a raw mobi, but anything you get from Amazon will be AZW or AZW3.

Replies:   Switch Blayde
Switch Blayde ๐Ÿšซ

@Dominions Son

MOBI is a very early e-book format. It predates e-pub

MOBI was created in 2000 by Mobipocket (Amazon purchased Mobipocket in 2005, Kindle 1 was released in 2007). EPUB was created ~2007 by the IDPF (International Digital Publishing Forum).

Crumbly Writer ๐Ÿšซ

@Switch Blayde

I assume "order of decreasing preference" means the first (LIT) is the best and the last (PDF) is the worst.

Unless it's been dramatically revised in the past five years, LIT is little more than a basic text format with a few in-line formatting options. And pdf would, by definition, be the worst, since it's coded for a specific fixed format output (i.e. not the standard definition of a free-flowing eBook that automatically adapts to ANY device type).

Replies:   Switch Blayde
Switch Blayde ๐Ÿšซ
Updated:

@Crumbly Writer

This is where Chapters and Page Breaks is discussed in Calibre's documentation: https://manual.calibre-ebook.com/conversion.html

calibre has two sets of options for chapter detection and inserting page breaks. This can sometimes be slightly confusing, as by default, calibre will insert page breaks before detected chapters as well as the locations detected by the page breaks option.

So if it detects a chapter heading it should do the page break. But I've had situations where it didn't (I don't have the specifics).

And then it says:

By default, calibre uses the following expression for chapter detection:

//*[((name()='h1' or name()='h2') and re:test(., 'chapter|book|section|parts+', 'i')) or @class = 'chapter']

This expression is rather complex, because it tries to handle a number of common cases simultaneously. What it means is that calibre will assume chapters start at either < h1> or < h2> tags that have any of the words (chapter, book, section or part) in them or that have the class="chapter" attribute.

My chapter names start with "Chapter" (e.g., Chapter 1) so it finds the word "chapter." I remember one situation where it didn't do the page break (and maybe it was the only situation) when my chapter name was "Preview - Chapter 1" (it was some preview chapters of a different novel at the end of the current novel). It appeared in the Table of Contents but did not have a page break before it. So maybe it was that it started with the word "Preview" rather than "Chapter."

Ernest Bywater ๐Ÿšซ

In checking the latest Calibre documentation, as there is an update as of 30 Oct 2020 which I just upgraded to, it states:

The EPUB I produced with calibre is not valid?

calibre does not guarantee that an EPUB produced by it is valid. The only guarantee it makes is that if you feed it valid XHTML 1.1 + CSS 2.1 it will output a valid EPUB. calibre tries hard to ensure that EPUBs it produces actually work as intended on a wide variety of devices, a goal that is incompatible with producing valid EPUBs, and one that is far more important to the vast majority of its users. If you need a tool that always produces valid EPUBs, calibre is not for you. This means, that if you want to send a calibre produced EPUB to an online store that uses an EPUB validity checker, you have to make sure that the EPUB is valid yourself, calibre will not do it for you โ€“ in other words you must feed calibre valid XHTML + CSS as the input documents.

I can attest to this as I validate my epubs with

http://validator.idpf.org/

While the Calibre epub has opened and been read in the two main epub readers I use on my computer the validator frequently rejected the epub for a a number of reasons. because of that I took to validating the html code prior to making the epub via

https://validator.w3.org/#validate_by_upload+with_options

In checking the epubs that failed validation as an epub I found the HTML code failed validation as well due to some faulty code, yet the html still opened and appeared correctly - most of the time. The most common problem was an incorrect or failed to include opening or closing statement in the html code which the browser simply ignored but the validator found - I suspect the same happened within Calibre and the epub then failed validation due to the faulty code.

Since instituted checking the html code through the validator then correcting any errors found, every epub has passed validation.

Replies:   Switch Blayde
Switch Blayde ๐Ÿšซ

@Ernest Bywater

http://validator.idpf.org/

Thanks for the link. Although it says it's in Beta.

All my epubs validated. They were created with Calibre using docx files as input.

Back to Top

Close
 

WARNING! ADULT CONTENT...

Storiesonline is for adult entertainment only. By accessing this site you declare that you are of legal age and that you agree with our Terms of Service and Privacy Policy.


Log In