Home ยป Forum ยป Author Hangout

Forum: Author Hangout

New Document type allowed for submissions (experimental)

Lazeez Jiddan (Webmaster)
Updated:

I modified the submission system to allow attachment of .docx files.

Currently the support is experimental to see if it's truly feasible.

If you decide to submit in .docx, please check the results after the posting carefully to see if the text had been extracted correctly. Currently supported formatting: bold, italics, superscript, subscript, headers, centring and right alignment. Blockquote isn't supported because the format doesn't really have it, the rest is like submissions in html.

Please report any problems.

Edits: typos

Replies:   Switch Blayde  oldegrump
Banadin ๐Ÿšซ

I thought all my SOL submissions were in docx. Never had any problems.

Ed

Lazeez Jiddan (Webmaster)

@Banadin

Until today, if you had attached a .docx file, it would have been rejected.

Switch Blayde ๐Ÿšซ

@Lazeez Jiddan (Webmaster)

Not paragraph indenting?

If not, can you put in a double line space when you encounter indenting to separate paragraphs with a blank line when the indenting is stripped?

Lazeez Jiddan (Webmaster)

@Switch Blayde

The paragraph indenting (first line) on SOL is not an author-set style. On the site here first paragraph indent is under the reader's control. You can set your style to read stories with the first line indented as much as you want. All readers can, but as an author you can't force it.

For ebooks, most e-readers override the built in stylesheet for ebooks. I for one make sure that I'm reading in my own font, no indent, no justification. I hate justified text on a small screen and I do most of my reading on my phone, and now most people do.

Switch Blayde ๐Ÿšซ

@Lazeez Jiddan (Webmaster)

I hate justified text on a small screen and I do most of my reading on my phone, and now most people do.

I switched to justified text only because that's how the traditionally published novels are.

It makes sense for print since the margins are uniform on the printed page, but I never thought to do it for an ebook until I checked out the traditionally published novels.

Vincent Berg ๐Ÿšซ

@Lazeez Jiddan (Webmaster)

The paragraph indenting (first line) on SOL is not an author-set style. On the site here first paragraph indent is under the reader's control. You can set your style to read stories with the first line indented as much as you want. All readers can, but as an author you can't force it.

Except, that's a literary strategy employed by many, including several SOL authors. It's NOT a 'first line' indent, as typically the first line is NOT indented. Instead, it's to designate information from an alternate source (ex: a handwritten note, a posted sign, or broadcast speech like a radio program). In other words, it's dialogue from someone who's NOT present in the story, is treated as 'quoted text', and is set apart so readers DON'T confuse it for dialogue from the regular cast members in the scene.

It's a handy visual shorthand that, while typically not explained in detail, is quickly absorbed by readers.

As Switch suggested, I typically include both indented text AND separate it with extra blank lines before and after the associated dialogue. That deftly avoids you're cutting the formatting.

Switch Blayde ๐Ÿšซ

@Vincent Berg

It's NOT a 'first line' indent, as typically the first line is NOT indented. Instead, it's to designate information from an alternate source

Do you mean blockquote? I believe SOL supports blockquote. Although I don't know if that will work if the file was a docx.

Switch Blayde ๐Ÿšซ

@Vincent Berg

As Switch suggested, I typically include both indented text AND separate it with extra blank lines before and after the associated dialogue.

That's not what I was talking about.

When I write a story to publish as an ebook, I indent the paragraphs except the first in a chapter or scene change. There are no blank lines between paragraphs.

When I write a story to post on SOL, I don't indent the paragraphs but put a blank line between the paragraphs.

I was wondering if I could write all my stories the same way (indenting with no blank line) and then, if it's a docx file, have SOL convert that to no indenting and add a blank line between paragraphs.

Replies:   Keet
Keet ๐Ÿšซ

@Switch Blayde

When I write a story to publish as an ebook, I indent the paragraphs except the first in a chapter or scene change. There are no blank lines between paragraphs.

When I write a story to post on SOL, I don't indent the paragraphs but put a blank line between the paragraphs.

That is perfectly fine. There's a difference between an ebook and a story on SOL. An ebook is a single instance of a story and all styling affects only that single story where on SOL that same story is part of a set of thousands of stories. An ebook is usually formatted following general publishing rules where on SOL the reader controls how a story is displayed. If Lazeez converted text-indent into the html pages the reader would see every story with different styling and can no longer set this with his preferences.
By-the-way: on SOL there are no blank lines between paragraphs but paragraphs are styled to show a top margin. When you put in a blank line it just marks a new paragraph for the converter.

Replies:   Switch Blayde
Switch Blayde ๐Ÿšซ

@Keet

By-the-way: on SOL there are no blank lines between paragraphs but paragraphs are styled to show a top margin. When you put in a blank line it just marks a new paragraph for the converter.

I didn't know that.

Then if the docx paragraph is indented, why can't the SOL converter just mark a new paragraph for the converter like it does when it encounters a blank line?

What I don't want is, if I indent paragraphs without the blank line between them, for it to end up as one huge paragraph. That's why I put in blank lines when submitting to the SOL Wizard.

Replies:   Keet
Keet ๐Ÿšซ
Updated:

@Switch Blayde

What I don't want is, if I indent paragraphs without the blank line between them, for it to end up as one huge paragraph. That's why I put in blank lines when submitting to the SOL Wizard.

The last line of what you quoted from my post: "When you put in a blank line it just marks a new paragraph for the converter." That's how I understood the converter recognizes a new paragraph, not by an indent. Someone correct me if I'm wrong, SOL is currently too slow to look up the exact rules.
Maybe Lazeez will make a change for the docx format where he also accepts a paragraph indent as a marker for a new paragraph. That would eliminate the need for an extra blank line and leave the docx the same for a conversion to an ebook.

ETA: to be clear, the extra line in the source document that is uploaded to SOL needs the blank line as a paragraph marker but the resulting html page on SOL doesn't have that line. It goes from < p > .... < /p > to the next < p > ... < /p >. You can see that if you right click on the page and look at the page source. There is a blank line between the p's but those are not processed by the browser engine. A 'real' blank line would be a < br > between the p's if it was converted and < br >'s are to be avoided as much as possible in html code because there's only limited support for styling it in CSS and not always consistent between browsers.

Switch Blayde ๐Ÿšซ

@Keet

Maybe Lazeez will make a change for the docx format where he also accepts a paragraph indent as a marker for a new paragraph. That would eliminate the need for an extra blank line and leave the docx the same for a conversion to an ebook.

That's exactly what I'm asking for.

Replies:   Keet
Keet ๐Ÿšซ

@Switch Blayde

That's exactly what I'm asking for.

Then you better make sure you don't miss the indention on a paragraph or the converter will not recognize a new paragraph (unless of course you add a blank line).

Replies:   Switch Blayde
Switch Blayde ๐Ÿšซ

@Keet

Then you better make sure you don't miss the indention on a paragraph

Word does it automatically when I hit the return (assuming the paragraph I'm on has the indent).

I don't indent the first paragraph of a chapter or new scene. But they're proceeded by the chapter name (which has a blank line after it) or the centered * * * * (which has a blank line before and after it).

Replies:   Vincent Berg
Vincent Berg ๐Ÿšซ

@Switch Blayde

I don't indent the first paragraph of a chapter or new scene. But they're proceeded by the chapter name (which has a blank line after it) or the centered * * * * (which has a blank line before and after it).

Again, using SOL rules, EVERY paragraph has an header between paragraphs (bland line & the next paragraph). Posting .docx should use your own guidelines, but SOL may force their traditional paragraph rules on EVERY line, regardless of how it's formatted.

Lazeez Jiddan (Webmaster)

@Vincent Berg

docx support on SOL is almost exactly like html support. Actually it's missing few tags. Like docx doesn't have a direct equivalent to {block}.

For the beta site's sample viewer, it's simply using your stylesheet that you've embedded in your epub. It's not converting anything at all. It's simply pulling few chapters of your EPUB and displaying them as is.

Future docx support on the beta site will not even allow images embedded in text (it's a horrible job importing images from docx). It will simply do mostly text conversion, create a TOC and embed the cover that you submit to create an EPUB file.

Replies:   Vincent Berg
Vincent Berg ๐Ÿšซ

@Lazeez Jiddan (Webmaster)

Future docx support on the beta site will not even allow images embedded in text (it's a horrible job importing images from docx). It will simply do mostly text conversion, create a TOC and embed the cover that you submit to create an EPUB file.

That makes sense. One benefit of ePubs is that the included images are composed of embedded zip files, which means repeated images (like graphic section breaks) aren't duplicated, making the images much smaller, plus it doesn't take extra processing by the author (Calibre packages the included images for you).

But if the SOL .docx processing doesn't include the document's formatting (inline instead of style based formatting), then there's really NO use in using it, it's just a more convoluted way of submitting html! (I just tried submitting a couple .docx single chapters, just to see how they processed.)

Lazeez Jiddan (Webmaster)

@Vincent Berg

But if the SOL .docx processing doesn't include the document's formatting (inline instead of style based formatting), then there's really NO use in using it, it's just a more convoluted way of submitting html!

It's the equivalent of html (surprisingly with less features). But I implemented it since it's basically free code carried over from the beta site, and it makes it much easier for inexperienced writers to submit their text to sol. It would amaze you how difficult some find converting their Word documents to html.

Replies:   Vincent Berg
Vincent Berg ๐Ÿšซ

@Lazeez Jiddan (Webmaster)

It would amaze you how difficult some find converting their Word documents to html.

Trust me, considering the amount of time I invest regularly cleaning up Word's html code, I can easily believe it. What's more, you seamless processing of my clean epub works MUCH better than ANY site's conversion process!

Keet ๐Ÿšซ

@Vincent Berg

Posting .docx should use your own guidelines

NO! One of the great things on SOL is that every story is represented in the same style as set by the user. You would ruin that with respecting the docx style. Individual styles are for print/ebooks, not for on-line reading where the user expects a consistent layout. Even e-readers allow you to overrule some of the set layout elements like font, line-height, etc. Just like SOL allows.

Replies:   REP
REP ๐Ÿšซ

@Keet

I would agree. I suspect that Lazeez will maintain the look and feel of the current format. It appears to me that Lazeez has been saying, he will convert the content format defined by the poster's .docx files to the format used by SOL. He also said, only certain .docx codes will be converted. I don't recall him saying the converted .docx file content would have the same appearance when it is posted.

Switch Blayde ๐Ÿšซ

@REP

he will convert the content format defined by the poster's .docx files to the format used by SOL

Again, that's what I expect, too. But what happens when the docx file indents paragraphs and does not have a blank line between those paragraphs?

The SOL format is no indent with blank lines between paragraphs. That's fine. I like it like that as a reader.

The question is, do I have to format my docx like that or will the converter get rid of the indent and add the blank line when it sees the indent?

Replies:   Keet  Vincent Berg
Keet ๐Ÿšซ
Updated:

@Switch Blayde

But what happens when the docx file indents paragraphs and does not have a blank line between those paragraphs?

You either have a paragraph indent, a blank line, or both. Either can be detected and converted to the SOL format of a blank line between paragraphs and I suspect that is what Lazeez will do.
ETA: It's of course not a 'real' blank line but a margin between paragraphs.

Replies:   Switch Blayde
Switch Blayde ๐Ÿšซ
Updated:

@Keet

Either can be detected and converted to the SOL format of a blank line between paragraphs and I suspect that is what Lazeez will do.

I'm hoping so. It would be the end of having to do things differently.

For example, if I'm working on a novel I use Word's italics. If I'm working on a story for SOL I put the < i > before and after the italicized word(s).

Now that SOL will support docx files, I can use Word's italics all the time (and SOL will convert it when it's an SOL story).

I would like the same for paragraphs. If I'm writing a novel, I indent the paragraphs (all but the first one in a chapter and after a scene break). If I'm writing an SOL story I don't indent and hit Return twice for the blank line. It would be nice if I could indent the paragraphs all the time and let SOL do it's thing to remove the indent and add white space between paragraphs.

Michael Loucks ๐Ÿšซ

@Switch Blayde

For example, if I'm working on a novel I use Word's italics. If I'm working on a story for SOL I put the < i > before and after the italicized word(s).

This is why I separate my layout from my markup, if I can use those terms. I write in a text editor (BBEdit) which is extremely lightweight and doesn't care if I have my entire series open at once. It supports folders and hierarchies, making organization easy. I use only a few of the basic SOL markup indicators (_, *, {tt}). This allows me to do literally anything with it.

When I need to create an eBook, I run a bash/zsh script to turn those markups into markdown, then import into Scrivener which has all the layout and formatting tools, and I've created a custom set for my novels. The process of going from the lightweight text editor to the ePub is literally less than two minutes, plus the time to generate the ePub.

BBEdit is free (if you don't need the advanced features) and has fantastic search and replace features, including 'grep' style search and regular expressions. If you're a coder, it has an amazing set of coder support (folding, color-coding, git, etc).

Of course, I'm on a Mac, so YMMV.

Replies:   Baltimore Rogers
Baltimore Rogers ๐Ÿšซ

@Michael Loucks

I write in XML with CSS classes in Vim. Then I use Groovy scripts to translate it to whatever format is used wherever I'm posting to.

I translate to full SOL markup format when posting here. Lazeez's markup preview tool was invaluable in helping me debug my script.

Replies:   Vincent Berg
Vincent Berg ๐Ÿšซ

@Baltimore Rogers

I translate to full SOL markup format when posting here. Lazeez's markup preview tool was invaluable in helping me debug my script.

The only thing you're missing, using this approach, is your ability to include publishing marks (em-dashes, ellipses, and other language features), which most SOL authors would NEVER need.

I've always been one to push the limits, to see just how far I can stretch them, either with plots, formatting or writing styles. I'm currently uncovering LOTS of unforseen 'bugs' in Lazee's beta code (much to his chagrin).

Vincent Berg ๐Ÿšซ

@Switch Blayde

Now that SOL will support docx files, I can use Word's italics all the time (and SOL will convert it when it's an SOL story).

I would like the same for paragraphs. If I'm writing a novel, I indent the paragraphs (all but the first one in a chapter and after a scene break). If I'm writing an SOL story I don't indent and hit Return twice for the blank line. It would be nice if I could indent the paragraphs all the time and let SOL do it's thing to remove the indent and add white space between paragraphs.

Correct. That's precisely what it does (preserves your in-line formatting).

Replies:   Switch Blayde
Switch Blayde ๐Ÿšซ

@Vincent Berg

Correct. That's precisely what it does (preserves your in-line formatting).

You contradicting yourself (this post and previous post).

First you said it will ignore paragraph commands (such as indent).
In your second post you said it preserves your in-line formatting.

In the future, If I submit a docx file to the SOL Wizard with:

[indent]xxx xxxx.
[indent] x xxx.

Will it show up on SOL as:

xxx xxxx.

x xxx.

or

xxx xxxx.
x xxx.

Replies:   Gauthier  Vincent Berg
Gauthier ๐Ÿšซ

@Switch Blayde

That's easy: it depends on your word paragraphs
if you wrote it:
[indent]xxx xxxx.[Para]
[indent]x xxx.[Para]
you will have a normal paragraph spacing.

if you wrote it:
[indent]xxx xxxx.[line break]
[indent]x xxx.[para]
you will have a normal line spacing

So not what you wished for in any case.

Adding an empty [para] would be removed by most editor as an error.

The only other standard break for word, would be a continuous section break, but I've never seen it used to mark a scene break.

So the word usage is to use styles (creating a style: "new Scene" with para indent or extra spacing) would do the trick, but Lazeez will ignore it as there is no way for him to identify the intent based on the style in a standard way. Word has no standard for scene break. It's the same problem Lazeez has with blockquote, the Microsoft Word style and formatting markup is not semantic.
You would probably be better served with a line of ***

Replies:   Switch Blayde
Switch Blayde ๐Ÿšซ
Updated:

@Gauthier

You would probably be better served with a line of ***

I'm not referring to a scene change.

I'm talking about paragraphs. Today, I write my SOL story in Word. I don't use italics. I put the < i > around the italicized words. I don't indent paragraphs. I put a blank line between them (hit Return twice). Then I save it as txt and submit it to the Wizard.

Of course when I write a novel in Word I use all of Words formatting (italics, paragraph indents without a blank line between them, etc.).

So I have to do things differently depending on whether I'm writing it for the SOL Wizard or, say, input to Calibre. I was hoping I could do it the same for both and have the new SOL converter convert it to the format SOL requires.

It sounds like that won't happen so I'm not sure what the advantage is for having the docx converter.

Replies:   Vincent Berg  REP
Vincent Berg ๐Ÿšซ

@Switch Blayde

Of course when I write a novel in Word I use all of Words formatting (italics, paragraph indents without a blank line between them, etc.).

As long as you're only italicizing indiividual words or lines, that'll continue to work. What won't is if you use a specially defined italicized paragraph style.

"In-line" formatting is formatting surrounded by < i>< /i> commands. "Style-based formatting" is when you predefine a particular paragraph type.

REP ๐Ÿšซ

@Switch Blayde

It sounds like that won't happen

The best way to find out is to post a chapter by submitting a .docx file and see what happens. If you don't like it, you can always resubmit the chapter using your former method.

Vincent Berg ๐Ÿšซ
Updated:

@Switch Blayde

You contradicting yourself (this post and previous post).

First you said it will ignore paragraph commands (such as indent).

In your second post you said it preserves your in-line formatting.

Sorry, "in-line" formatting means formatting "within the line", as opposed to "Style" or paragraph formatting. Since I tend to adapt practices based on publishing industry standards, I often use publishing terms.

But as others have noted, it displays on SOL depends entirely on how you have SOL set up to display pages, as it's user selectable. But the SOL default is #3.

@Gauthier

Adding an empty [para] would be removed by most editor as an error.

The Industry (html) standard for that is to use "< p>& nbsp;< /p>".

Vincent Berg ๐Ÿšซ

@Switch Blayde

The question is, do I have to format my docx like that or will the converter get rid of the indent and add the blank line when it sees the indent?

Having tried it, the .docx (it doesn't accept plain .doc files) simply ignores ANY paragraph commands other than center/left/right alignment.

Vincent Berg ๐Ÿšซ

@REP

It appears to me that Lazeez has been saying, he will convert the content format defined by the poster's .docx files to the format used by SOL. He also said, only certain .docx codes will be converted. I don't recall him saying the converted .docx file content would have the same appearance when it is posted.

I ran a test, submitting an existing chapter using a .docx file, and you're right. As I mentioned previously, it basically acts as an alternative to submitting via html (i.e. it accepts html's inline formatting, but ignores ALL predefined formatting).

Thus, for me at least, there's no reason to NOT continue submitting via html (which still offers more options than submitting via text).

Replies:   REP
REP ๐Ÿšซ

@Vincent Berg

which still offers more options than submitting via text

I have been submitting in HTML format. I posted the HTML files Word created without editing them.

Replies:   Vincent Berg
Vincent Berg ๐Ÿšซ

@REP

I have been submitting in HTML format. I posted the HTML files Word created without editing them.

The comment was about you, merely a reflection on how much you can accomplish via html (beyond basic inline formatting). As for editing html files, I've been doing this for a while, and learned long ago that Word adds a bunch of junk in their html. In order to understand what it was including, and how to circumvent it, I started examining it. What I learned was shocking, and I've never looked back on custom-building my ePubs (it's extremely rare when one my mine fails a basic epub check).

For most, it's an unjustified expense, but because I'm so particular about what I do, the cleaner, smaller and more efficient code is definitely worth it.

Replies:   Switch Blayde
Switch Blayde ๐Ÿšซ

@Vincent Berg

Word adds a bunch of junk in their html

I think we're talking about SOL here, not creating an ebook. It doesn't matter how much junk Word puts in the HTML. The SOL Wizard strips it out.

Replies:   Vincent Berg
Vincent Berg ๐Ÿšซ

@Switch Blayde

I think we're talking about SOL here, not creating an ebook. It doesn't matter how much junk Word puts in the HTML. The SOL Wizard strips it out.

I was explaining WHY I'm so obsessed with 'cleaning up' my html files, because REP had commented on it. More than anything else, my obsession is designed to help my documents survive 'converters' which aren't as well implemented as Lazeez's are. The fact I keep stretching the limits means I'm fairly well-informed on overcoming various problems in publishing and posting online. Nothing more.

Vincent Berg ๐Ÿšซ

@Keet

Maybe Lazeez will make a change for the docx format where he also accepts a paragraph indent as a marker for a new paragraph. That would eliminate the need for an extra blank line and leave the docx the same for a conversion to an ebook.

The new .docx code which Lazeez implemented (at least on the beta site) does exactly that, as I can customize the definition of individual lines (Paragraph Styles). However, no one seems to be clear on what takes precedence in the conversion, the SOL paragraph spacing or the newer .docx conversion. Someone needs to experiment.

Replies:   Switch Blayde
Switch Blayde ๐Ÿšซ

@Vincent Berg

The new .docx code which Lazeez implemented (at least on the beta site)

I thought the new docx was SOL, not bookapy.

Replies:   Vincent Berg
Vincent Berg ๐Ÿšซ

@Switch Blayde

It was supposedly based on the beta site's new .docx code. I was hoping they operated similarly, but apparently it's just a more complicated, more error prone and messy way of posting as html.

Banadin ๐Ÿšซ

Guess I was confused.

Vincent Berg ๐Ÿšซ

This was added as part of a new beta service by Lazeez, and I must say, it's working extremely well! I'm assuming it's slightly modified, as my text on that site ALLOWS indented Style Defined text, so we'll have to see how it's implemented.

But he's done an amazing job, much better than most self-publishing sites do (lulu.com, smashwords.com, Apple.com, etc.). I typically do ALL my formatting by Style Definitions, including graphical text (rendered specialized fonts), and he's correctly captured them ALL!

Good job, as usual, Lazeez.

Lazeez Jiddan (Webmaster)

There have been multiple submissions already on the site that were posted using the docx format. They're like all the other stories on the site, they're formatted using our story stylesheet.

The converter detects paragraphs and handles them automatically. Again, it pull the text and looks for italics, bold, strikethrough, superscript and headings 1 to 4 and applies the appropriate html tags. The final styling is done with the normal stylesheet and the user customizations.

oldegrump ๐Ÿšซ

@Lazeez Jiddan (Webmaster)

What about straight .doc files, I do not have a word processor that will create .docx files. Currently, I submit in HTML to retain formatting but even that occasionally inserts an extra line and removes indents that are part of my personal preferences.

Replies:   REP  Keet
REP ๐Ÿšซ

@oldegrump

I used to go to a store that sold old software at discounted prices. I've bought some of my current software at very reduced prices on-line.

I did a search and found this, which might help you out.

https://www.ebay.com/itm/302235394155

Replies:   Switch Blayde
Switch Blayde ๐Ÿšซ

@REP

I used to go to a store that sold old software at discounted prices.

I once worked for a company that sold servers and other computer stuff. I was given a tour that included an area where they had vending machines that sold software. It was in an alpha state. The company was looking for venture capital to kick off the business. I don't think they got it.

Replies:   Keet
Keet ๐Ÿšซ

@Switch Blayde

I once worked for a company that sold servers and other computer stuff. I was given a tour that included an area where they had vending machines that sold software. It was in an alpha state. The company was looking for venture capital to kick off the business. I don't think they got it.

Those machines are now virtual and called app stores ;)

Keet ๐Ÿšซ
Updated:

@oldegrump

What about straight .doc files, I do not have a word processor that will create .docx files.

You can use LibreOffice to open a .doc file and save it as .docx. It's free and available for all OS platforms.

There are several reasons why .doc will never be supported. For starters, it's ancient and no longer used in business. The other reason is that .doc is a proprietary binary format which means you can only read it with the correct editor, usually MS word. .docx is just a zip file with flat text files which can be opened and read by almost anything and anyone.

oldegrump ๐Ÿšซ

REP

I do not like the newer vers of word. and openoffice does no import files in doc format well. I will continue to work with I have and post as html.

Replies:   Keet
Keet ๐Ÿšซ

@oldegrump

I do not like the newer vers of word. and openoffice does no import files in doc format well. I will continue to work with I have and post as html.

I'm not sure about the status of OpenOffice but LibreOffice does imports very well, including the .doc format. LibreOffice is rapidly becoming a better and more reliable word processor than Word is. Both are missing a few (obscure?) features that the other does have. Big plus for LO is that it is supported on all platforms and there's even a viewer for Android (with editing functions in test).

Back to Top

 

WARNING! ADULT CONTENT...

Storiesonline is for adult entertainment only. By accessing this site you declare that you are of legal age and that you agree with our Terms of Service and Privacy Policy.


Log In