| Commit message (Collapse) | Author | Age | Files | Lines |
| |
|
|
|
|
|
|
|
|
|
|
| |
Completely get rid of StringIO.py and cStringIO.c.
I had to fix a few tests and modules beyond what Christian did, and
invent a few conventions. E.g. in elementtree, I chose to
write/return Unicode strings whe no encoding is given, but bytes when
an explicit encoding is given. Also mimetools was made to always
assume binary files.
|
| |
|
|
|
|
|
|
| |
(a) "except E, V" -> "except E as V"
(b) V is now limited to a simple name (local variable)
(c) V is now deleted at the end of the except block
|
| |
|
| |
|
| |
|
|
|
|
|
| |
whitespace characters to a single space.
Small changes elsewhere, mostly to clean up the code a little.
|
|
|
|
|
|
|
|
|
|
|
| |
For the real document element, make sure the prolog is migrated into
the document element so it isn't left stranded.
Make fixup_trailing_whitespace() whitespace do what was really intended.
Add the *desc environments used in the C API manual to the list of
things that can exist at the paragraph level so they don't get wrapped
in <para>...</para>.
|
| |
|
|
|
|
| |
of something ad-hoc.
|
|
|
|
| |
powerful latex2esis.py.
|
|
|
|
| |
made a part of the paragraph.
|
| |
|
|
|
|
|
|
|
| |
(minidom in particular); it was using PyDOM which is now obsolete.
Only write the output file on success -- this avoids updating the timestamp
on the file on failure, which confuses "make".
|
| |
|
|
|
|
|
|
|
|
| |
join_adjacent_elements(): Hack to merge adjacent instances of
<option>; the source \programopt with GNU-style long options
created problems with LaTeX2HTML; this removes the evil
workaround, which should never be necessary from structured
documents(!).
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
<description> elements. Add another sanity
check to make sure a special case only becomes
involved for <opcodedesc> and <opcodedescni>
elements.
fixup_paras_helper(): Clean up control flow.
Everywhere that checks node.nodeType==ELEMENT and node.tagName now
just check node.nodeName (nodeName is guaranteed for all node types
and will be special values that don't conflict with GIs for
non-ELEMENT nodes). Simplifies a number of tests and improves
readability in a lot of places.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
engine in latex2esis.py can do, so discard it.
paragraph detection & markup: Handle some of the list-like
environments properly, picking up paragraphs inside
list items. (Only changed a few element-sets used for
recognition.)
fixup_refmodindexes_chunk(): Use the right attribute to determine the
module a <ref*modindex/> element is referring to.
convert(): Don't call remap_element_names(); it was removed.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
same as sys.stderr.write(), but bwrite() will make it
bold if stderr is a tty (for visibility).
Use these everywhere instead of accessing
sys.stderr.write() directly.
find_all_child_elements(): Find all elements by type name that are
immediate children of another.
cleanup_synopses(): Receive both the document and fragment objects as
parameters; if only the document is passed, synopsis
information is never found since it's in the fragment!
build_para(): Add a chunk of code to ensure that a newline always
follows the </para> end-tag. Not really important, but
makes presentation consistent for authoring/
maintenance.
fixup_bifuncindexes_chunk(): Use find_all_child_elements() instead of
find_all_elements() so we don't work with too many
levels at once; that could cause DOM exceptions with
the <elem>.removeChild() method.
|
|
|
|
| |
very interesting.
|
|
|
|
| |
in a fragment rather than the main document object.
|
|
|
|
| |
creating the <moduleinfo>.
|
|
|
|
| |
become <interpreter-session> elements.
|
| |
|
|
|
|
|
| |
this takes 5 minutes off the conversion of the whole tree by
reducing the number of tree-traversals from 14 to 1.
|
|
|
|
|
| |
the right <signature>s and <description>. This seems to work a lot
better now!
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Lots of changes to get the paragraph marking to work, and not go into
an infinite recursion.
Start to rationalize markup of method signatures; not complete.
find_all_elements(): similar to getElementsByTagName(), but operates
on the not-quite-legal fragments we deal with here.
Simplifies several individual transforms.
fixup_sectionauthors(): Convert \sectionauthor items to <author> just
after the <section>'s <title>.
|
|
|
|
|
|
| |
<author> inside the <moduleinfo> element.
Fix problems with paragraph identification.
|
|
|
|
|
|
| |
than one "root" that prevented all the <label id=...> items
from being promoted to id attributes on the enclosing chapter/
section/... properly.
|
|
|
|
| |
to <module link>.
|
|
|
|
|
|
|
|
|
|
|
|
| |
currently generated by the LaTeX and LaTeX2HTML processes is generated
here as well, making it more flexible in the SGML version.
Reduce the <args> element so that <optional> goes away; just use
square brackets to indicate what's optional. This makes it easier to
read than the LaTeX, and the processor can do any checking it needs to
in order to make sure it's legit. Possible shortcoming: DSSSL
processors may need more explicit markup. Can probably hack around it
for this case, but we'll see.
|
|
|
|
|
| |
conformant with the OASIS Exchange model. There's a little more
massage left to do, but not much.
|
| |
|
|
|
|
|
|
|
|
|
| |
<moduleinfo> element based on various meta information, and
strip some cruftiness. This is more usable for information
extraction, and organizes the information more clearly.
cleanup_synopses(): Rewrite to use create_module_info(), so this
will work with multi-rooted "documents".
|
|
|
|
| |
little; more thinking is needed about what we really want.
|
|
|
|
|
| |
Generate ESIS data instead of XML.
Misc. changes to some transforms.
|
| |
|
|
LaTeX-based ESIS streams to be a little better structured, and generally
perform clean-up.
Preliminary.
|