In this webcast recording, George Bina shows you how to create DITA content from zero to a full deliverable using oXygen. The full deliverable leads to multiple publishing formats.
Author Archive

Vendor webcast: An overview of authoring DITA with oXygen
May 15th, 2012 by ScriptoriumTech

Webcast: Collaboration: A hands-on demo using Confluence wiki
April 27th, 2012 by ScriptoriumTech
In this video recording, guest presenter Sarah Maddox explains why collaboration is a good thing, why a wiki is a good solution for it, and how to do it on Confluence.

Vendor webcast: Collaborative content development with easyDITA
April 4th, 2012 by ScriptoriumTech
Paul Wlodarczyk shows how cloud-based tools like easyDITA can change the way you approach collaboration, and in turn speed your time to publish and simplify your work process.
(more…)

Vendor Webcast: Bring your DITA content to the next level with IXIASOFT
March 1st, 2012 by ScriptoriumTech
Have you ever wondered how to effectively manage your DITA content as it continues to grow? Jean-François Ameye shows how IXIASOFT’s full featured DITA CMS solution handles your entire technical documentation process from authoring and searching to reviewing and publishing.

Webcast: HTML5 and its impact on technical communication
February 16th, 2012 by ScriptoriumTech
In this webcast recording, guest presenter Peter Lubbers gives a fast-paced overview of HTML5 with a focus on how it affects the tech comm field. He covers what exactly HTML5 is, why you should care, and how you can develop with HTML5. The session covers which browsers support which features, and how you can make the new features work in older browsers so you can start using HTML5 today.

Webcast: Content strategy for software development (with Ray Gallon)
September 29th, 2011 by ScriptoriumTech
Content strategy is usually thought of in the context of web development. But today’s software is increasingly information-rich. Software is a content vector, and we need to manage the life cycle of that content. This webcast from guest speaker Ray Gallon adapts content life cycle management principles, taken from web-oriented content strategy, to software development cycles. Some examples from real experiences illustrate this adaptation. (more…)

Webcast: Crafting Clarity in a Climate of Chaos
April 27th, 2011 by ScriptoriumTech
Scriptorium hosts Tristan Bishop of Symantec as he discusses what technical writers need to do to keep up with transforming communication methods and rapid advances in global, mobile, and social dialog.
(more…)

Webcast: DITA Best Practices
March 11th, 2011 by ScriptoriumTech
In this webcast hosted by Scriptorium, author Tony Self discusses his new book, The DITA Style Guide, and how it fits into a DITA workflow.
(more…)

Webcast: Discussion forum: STC vice president candidates
March 2nd, 2011 by ScriptoriumTech
Alan Houser and Vici Koster-Lenhardt are running for the office of Vice President of the Society for Technical Communication. If you missed the live webcast, watch this recording to get to know the candidates. (more…)

A makeover for the DITA OT’s PDF Plugin
January 10th, 2011 by ScriptoriumTech
by David Kelly
Adoption of DITA involves a fair number of hurdles, not the least of which is getting nicely formatted output. Two kinds of output in particular present problems: tripane web help, because it is not provided in the DITA OT, and PDF, because the plugin is difficult to modify. My esteemed colleague, Simon Bate, has addressed the tripane help issue, so you can look forward to a blog post from him on that subject.

To OT or not to OT?
November 29th, 2010 by ScriptoriumTech
by David Kelly
In previous publications, Scriptorium staffers have made the case that open source software (“free”) is not necessarily cheap. Our experience with customizing the DITA Open Toolkit (OT) emphasizes this point daily. One question that always crops up is, “Is the DITA OT right for me, and how much will it cost to make it right?” (more…)

The promise of XML publishing
September 2nd, 2010 by ScriptoriumTech
by David Kelly
Being a technological optimist, I always watch for ideas that provide more, better, faster, cheaper, healthier, and throw in a heap of wow factor while you’re at it. On the other hand, after observing the progress of technological ideas over a few decades, I am beginning to formulate what I call “The Principle of Minimum Adequacy.”
(more…)

Webcast: Knowledge integration: The future of technical communication
July 16th, 2010 by ScriptoriumTech
Scriptorium hosts Tristan Bishop of Symantec as he muses on technical communicators’ evolving roles.

Customizing PDF output with the DITA Open Toolkit
March 31st, 2010 by ScriptoriumTech
by David Kelly
Read in PDF
(380 KB, 23 pages)
The default DITA Open Toolkit (DITA OT) provides XSL-FO stylesheets that create simple, generic PDF files. The default DITA OT PDF output is a good starting point, but it is an ugly place to stop.
To create better-looking PDF output with more publishing functions, you need to change XSL-FO stylesheets. What kinds of things can you do, and where do you start? This paper explains how to modify XSL-FO stylesheets in the DITA OT.

Customizing PDFs in the DITA OT – Yes!
March 31st, 2010 by ScriptoriumTech
by David Kelly
The value of the DITA Open Toolkit as a one-stop workflow for “Outputs, All Sorts” is diminished if it is too difficult to customize PDFs in the toolkit. For many people, the DITA OT’s PDF plugin, and XSL-FO in general, are very much a “riddle, wrapped in a mystery, inside an enigma.” I would not dispute that XSL-FO and the DITA OT can be difficult to modify, but after having trained myself in XSL-FO, I have found that a few good examples can be extremely helpful.
The Internet contains a wealth of specific solutions for different XSL-FO problems, and there are books available on XSL-FO and on DITA. But wouldn’t it be nice if there were something that focused on the application of XSL-FO to the DITA Open Toolkit, even for something as basic as changing page margin sizes or modifying the look of the cover page and footers?
In an attempt to fill the void, I have written a white paper on the topic:
Customizing PDF output in the DITA Open Toolkit (PDF file, 380 KB)
I hope the paper makes the XSL-FO in the DITA OT more accessible to people who want to customize it.
As a word of caution: every introductory statement I have read about XSL-FO includes a statement to the effect that “XSL-FO is extremely verbose.” The same applies to explanations about how the XSL-FO templates in DITA work. I won’t claim that the content of this white paper is “simple.” (I don’t dare!)
I hope to start hearing soon about your spiffy new customizations.

Webcast: DITA-FMx demonstration
March 16th, 2010 by ScriptoriumTech
This webcast demonstrates using the DITA-FMx plugin with FrameMaker 9 to author, edit, and create output from DITA content. Topics covered during the demo include creating DITA topics using different options and templates and generating a book from the map and then saving to a PDF file.
(more…)

You love PDF. You hate making it.
January 21st, 2010 by ScriptoriumTech
by David Kelly
The DITA Open Toolkit uses FOP to create PDFs directly from XSL-FO. There are many PDF features that XSL-FO and FOP cannot provide during this conversion—configuring PDF security features, for instance, and using EPS graphics. While attempting to provide better graphics for a customer, we realized that FOP can also transform XSL-FO to Postscript, and from there a second step could use Adobe Acrobat Distiller to create PDFs. Initial results are in, and like the Ronco Cap Snaffler, “It really, really works!” (In fact, it probably works better than the redoubtable Cap Snaffler itself.)
Using an FO-to-Postscript-to-PDF chain makes a lot of sense for creating PDFs. The ability to incorporate EPS files into PDFs makes it worthwhile by itself. But it also opens possibilities for fine-tuning PDFs. The Postscript-to-PDF phase invites the addition of a reference to a Distiller job option file. Using this arrangement, you could create different kinds of PDFs in a single workflow. Variations could include different page sizes, different optimizations (screen and print), and different security settings. I’m getting all Shamwow just thinking about it.
Adding Acrobat Distiller to the workflow also suggests the possibility of adding post-Distiller tools into the mix. It would be interesting to explore the ability to enable commenting by using a command-line based tool such as CAM::PDF or perform additional PDF configuration tweaking with a programming resource like PDF::API2 . It would also be interesting to see whether any of the MicroType TimeSaver/Assistants could be grafted into the workflow. I’m just blue-skying here.
The reference to the Ronco Cap Snaffler made me think of other inventions that could slice, dice, and automate PDF configurations. What could be more appropriate than the slapchop? How can you not think of the DITA Open Toolkit when you hear “You love salad.You hate making it.”?
I’d be interesting in hearing other possibilities you might think of for pre- and post-Distiller PDF functions that could be spliced into the Open Toolkit.

Structured authoring and XML
December 31st, 2009 by ScriptoriumTech
Structured authoring and XML represent a significant paradigm shift in content creation. Implementing structured authoring with XML allows organizations to enforce content organization requirements. The addition of hierarchy and metadata to content improves reuse and content management. These benefits, however, must be weighed against the effort required to implement a structured authoring approach. The business case is compelling for larger writing organizations; they will be the first to adopt structured authoring. Over time, improvements in available tools will reduce the cost of implementing structured authoring and make it affordable for smaller organizations.

Handling XSL:FO’s memory issue with large page counts
December 31st, 2009 by ScriptoriumTech
Formatting Object (FO) processors (FOP, in particular) often fail with memory errors when processing very large documents for PDF output. Typically in XSL:FO, the body of a document is contained in a single fo:page-sequence element. When FO documents are converted to PDF output, the FO processor holds an entire fo:page-sequence in memory to perform pagination adjustments over the span of the sequence. Very large page counts can result in memory overflows or Java heap space errors. (more…)

Friend or foe? Web 2.0 in technical communication
December 31st, 2009 by ScriptoriumTech
The rise of Web 2.0 technology provides a platform for user-generated content. Publishing is no longer restricted to a few technical writers—any user can now contribute information. But the information coming from users tends to be highly specific, whereas technical documentation is comprehensive but less specific. The two types of information can coexist and improve the overall user experience. (more…)

Managing implementation of structured authoring
December 31st, 2009 by ScriptoriumTech
Moving a desktop publishing–based workgroup into structured authoring requires authors to master new concepts, such as hierarchical content organization, information chunking with elements, and metadata labeling with attributes. In addition to these technical challenges, the implementation itself presents significant difficulties. This paper describes Scriptorium Publishing’s methodology for implementing structured authoring environments. This document is intended primarily as a roadmap for our clients, but it could be used as a starting point for any implementation.

Assessing DITA as a foundation for XML implementation
December 31st, 2009 by ScriptoriumTech
The Darwin Information Typing Architecture (DITA) is being positioned as the solution for XML-based technical content. Is DITA right for you?
This white paper describes the potential business advantages of DITA, provides a high-level overview of DITA’s most important features, and then discusses how you can decide whether to develop a DITA-based XML implementation.

Configuring fonts in FOP and the DITA Open Toolkit
December 31st, 2009 by ScriptoriumTech
by David Kelly
The DITA Open Toolkit includes an installation of FOP, a tool used to convert DITA content to PDF. A user must configure both FOP and the Open Toolkit to use any other fonts beyond a few basic defaults. This paper supplements the Apache instructions for configuring FOP and provides additional information for integration of FOP with the Open Toolkit.

Webcast: Documentation as conversation
December 31st, 2009 by ScriptoriumTech
Even if your documentation system does not converse with your users, your documentation can help customers talk to each other and make the connections that help them do their jobs well or learn something new as if they were in a classroom with a community for classmates. This talk describes how you can think about documentation and user assistance in a conversational way, with the help of social media technology. I’ll discuss the topics in my new book, Conversation and Community: The Social Web for Documentation. I’ll describe the use of in-person Book Sprints that combine wikis and community events to gather together writers to accomplish documentation goals.
(more…)

Webcast: Beyond documentation
December 31st, 2009 by ScriptoriumTech
In these challenging economic times, what does the future hold out for technical communicators? What can we do to make sure we forge out a career?
Non-technical individuals will always need something to explain technical things, but as they change the ways in which they get their information, will technical communicators be stuck in the passing lane?
This session looks at the future of technical writing and likely changes to the ways in which user assistance is delivered. Are we moving beyond documents? If so, to what? Importantly, what does this mean for technical communicators?
About Ellis Pratt
Ellis has over ten years experience working on documentation projects and is an accomplished speaker on topics such as the future trends for user assistance, online Help and online communities. He is the author of “Tech Writing 2.0 – The application of Web 2.0 technologies to technical documentation”, “So you want to become a technical author” and “Network to Get Work”.
His philosophy is winning by sharing, connecting people to what they seek, networked businesses and people. The aim is to create a network of business professionals who support one another, learning, networking and trading. He is also an Associate of the Institute of Engineering and Technology.
Ellis is the co-owner of Cherryleaf Ltd.

Webcast: Cost-effective document design for a translation workflow
December 31st, 2009 by ScriptoriumTech
In this webcast, Nick Rosenthal discusses the challenges companies face when translating their content and offers some “best practices” to managing your localization budget effectively, including XML-based workflows and ways to integrate localized screen shots into translated user guides or help systems.
About Nick Rosenthal
Nick is the Managing Director of Salford Translations Ltd and has 20 years of experience in translator training and professional development. He sits on the Translation Subcommittee of the OASIS Committee, is a former member of the ITI Council, and past president of the UK chapter of the Society for Technical Communication.

Creating PDF files from DITA content
December 31st, 2009 by ScriptoriumTech
by David Kelly
When technical documentation groups adopt structured authoring, they often choose the Darwin Information Typing Architecture (DITA) open-source standard because it requires no up-front investment. The DITA Open Toolkit (DITA OT) provides a way to produce multiple outputs, including Portable Document Format (PDF) files; however, the technology for creating PDF files is limited, and modifying the formatting is challenging.
Software companies have stepped in to offer methods for customizing and producing PDF files, but there are different levels of DITA support and various degrees of integration with the DITA OT.
How do you decide which method is best for you? This paper explains the alternatives and trade-offs for each method and helps demystify the decision process.
Download the PDF
(950 K)

FrameMaker 8 and DITA
December 31st, 2009 by ScriptoriumTech
FrameMaker 8 provides support for creating, editing, and publishing DITA 1.0 content. You can use FrameMaker to author and publish DITA content as high-quality print and PDF output, or you can create content elsewhere and use FrameMaker strictly for publishing print and PDF output. This document describes FrameMaker’s DITA support and provides guidance on how you might best integrate DITA and FrameMaker.
Download the PDF file
(5 MB, 55 pages).

The State of Structure
December 31st, 2009 by ScriptoriumTech
In early 2009, Scriptorium Publishing conducted a survey to measure how and why technical communicators are adopting structured authoring.
Of the 616 responses:
- 29 percent of respondents indicated that they had already implemented structured authoring.
- 16 percent indicated that they do not plan to implement structured authoring.
- 14 percent were in the process of implementing structured authoring.
- 20 percent were planning to do so.
- 21 percent were considering it.
This report summarizes our findings on topics including the reasons for implementing structure, the adoption rate for DITA and other standards, and the selection of authoring tools.
Download
(2 MB, 56 pages)
Discuss this document in our forum

New White Paper: Configuring fonts in FOP and DITA OT
October 19th, 2009 by ScriptoriumTech
by David Kelly
We recently updated a new instance of DITA OT 1.4.3 to create PDFs of Scriptorium proposals from DITA source files. We wanted to emulate the original proposal template, which included a lot of fonts that were not configured in the DITA OT’s PDF rendering engine, FOP.
Remembering the nuances of font configuration for FOP turned into a lengthy exercise. After seeing requests for help on configuring fonts in the DITA user’s group forum and FOP’s Nabble forum, we decided to publish a white paper addressing the subject. It covers FOP configuration, DITA OT configuration to use FOP-configured fonts, and how to reference configured fonts in the DITA OT’s XSL-FO style sheets.
The white paper is now available here:
Hopefully it will help prevent ugly little font scenarios like this one:

HTML 5: Browser Wars Reprise?
September 21st, 2009 by ScriptoriumTech
Recently, I ran across an article by Rob Cherny in Dr. Dobb’s Journal. He suggests that the added features in HTML 5 combined with an end to the development of XHTML point to a brighter standards-based future. He sees closed solutions like Flash, Silverlight, and JavaFX being supplanted directly by HTML 5 code. His view is that the web owes its success to standards.
It’s tempting to agree. Standards certainly allow for collaborative growth. Though I’m not the least bit convinced that collaborative growth is the foundation of the web’s success. I believe that the web’s incredible success is really traceable to the simplicity and flexibility of HTML. Each new version takes us further from that simplicity.
Through the browser war years we saw the impact of new features in HTML—incompatibility among browsers. My sense is that the success of Flash is largely due to the fact that Adobe owns both ends of the problem. They create the tools that generate Flash code as well as the viewer. Web developers can pretty much assume that what they see, when they build a Flash-based solution, is what the end user will see.
I fear that we will head right back to the bad old days if HTML 5’s complex capabilities are widely employed. I suspect that ‘wait and see’ will last a pretty long time. I have other concerns about HTML 5—more on that later. What do you think—will your organization take advantage of these new capabilities as soon as they are available?

Conversion Is Dirty – 2300 Years Later (Part γ of γ)
September 9th, 2009 by ScriptoriumTech
by David Kelly
In previous installments I gave a worm’s-eye look at the details of our work with papyrological texts, and a satellite-level thumbnail sketch of the history of the documents and their contents. Now it’s time for a street-level view of how our work fits into the present.
From the point of view of the texts, the definition of “audience” has made an extreme shift. Tax records, contracts, letters, and so forth, long ago lost their primary audiences, but now scholars are interested in the documents. For some of those scholars, the degraded texts represent a challenge for reconstructing the original content. These are the papyrologists, the people we have been engaged to help. Once the texts have been reconstructed as much as possible, other students of Classical times use them for a variety of purposes.
The condition of the papyrus is only one of the problems faced by papyrologists. Questions of provenance, poor handwriting, faded ink, and access to remote storage locations are other issues. And then there are the publication methods used to engage the academic community in discussion and consensus about a given scholar’s results.
We learned, for instance, that since the late 1800s when the transcription of papyri started being addressed by the academic community, only 60,000 documents have been transcribed. It is a slow, painstaking process that requires special training at graduate levels in Classical Literature. Funding is spotty, but the need is still significant. At the University of Berlin alone there are 50,000 papyri waiting to be addressed.
The publication cycle for transcriptions and commentary on papyri is slow. In the Papyrology Room at Duke, the shelves are filled with academic journals, some of which are published on an annual basis, many less frequently. Publication of a document consists of the scholar’s transcription from the original papyrus (or a high quality reproduction) to the typographical format using the Leiden conventions mentioned earlier. It is a highly specialized publication process. Once a document is published in a journal (after, one imagines, years of waiting), other scholars examine the document and, if possible, compare it to the original or to a facsimile. (Facsimiles are seldom available in the journal containing the transcription, presumably because photographic reproductions are expensive.) If there are corrections or commentary to add to the transcriptions, it may be years before they are published in another journal.
Surely there must be a better way to make this information available to the scholars and interested parties of the world.
The IDP initiative was conceived to address many of these problems. The target of this initiative is to have a web location where transcriptions, translations, and digitized images of the original papyri are available and indexed against each other. Existing publications are in the process of being converted and cleaned up. (This is where Simon and I came in.) In addition to the primary sources, the website will include facilities for commenting on the texts, adding new documents, and having the ability for a community of scholars to participate in reviewing and publishing new or changed materials. Publication cycles may be reduced from years to days. And instead of having to travel to select universities with papyrology programs and cloistered libraries, anyone with Internet access would be able to read and view the most recent advances in papyrological scholarship.
You can get a preview and a feel for the information system by looking at a predecessor system, APIS (http://www.columbia.edu/cu/lweb/projects/digital/apis/index.html)(A new prototype interface is available at http://papyri.info).
Toward the end of the project, Simon and I discussed the feelings we had about participating in the present phase of an information chain that went all the way back to ancient Greek civilization. For one thing, there was an enormous feeling of responsibility to get it right. There was also a sense of honor and humility in being included in the history of these texts. These documents came long before us, and now they will bear some (hopefully transparent) mark of our impact on them. For a long, long time.
We have learned a dramatic lesson that the records we generate or pass along may have a life that persists long after we are gone. As we manipulate information with various tagging schemes, as we choose storage media and identify audiences for our delivery methods, it may be helpful to reflect on time scales that go beyond our immediate deliverables.
Someone a few months – or a few millenia – down the road may thank you.

Conversion is Dirty – 2300 Years Later (Part β of γ)
September 3rd, 2009 by ScriptoriumTech
by David Kelly
Aside from the pulp-story like beginning, the story of our project with the Integrating Digital Papyrology (IDP) initiative provides a fascinating case study in what happens to text over time. The time scale is exaggerated, but it provides a not-unfamiliar scenario for the kinds of things that can happen to text. Writers beware! The story includes composition errors, archiving inadequacies, data degradation, restoration procedures, translation, analysis, repurposing due to audience shifts, structuring, markup conversion, acceleration of publication methods – the works.
For myself, the excitement of this story focused on two moments in the Papyrology Room. The first came when one of the participants pulled a volume from the shelves and opened it. The book comprised a photographic reproduction of a scroll containing the original Constitution of Athens (http://en.wikipedia.org/wiki/Constitution_of_the_Athenians). This is an artifact discovered in 1879 that may have been written by Aristotle or one of his students. I learned that of the 169 constitutions written by Aristotle and his students, this is the only one to survive. It survived because the back of it was used for tax records, which apparently were considered much more important than landmark documents in the history of human civilization. (History lesson: File your taxes on the backs of your most important documents.)
The second moment came when we were able to get up close with a papyrus that was being studied by a graduate student. Sandwiched between two sheets of glass, it was a grid of pounded papyrus reed fiber with significant deterioration, particularly in the areas where two soft sections of the reed matter intersected. Across this surface, approximately 1800 years old, you could see the neatly scribed, pale ink handwriting in ancient Greek.
NOTE: The following is a palimpsest on papyrus dating from between 350 and 351 A.D. This image is available from a web page titled “Declaration on oath (P.Duk.inv. 11 R),” http://scriptorium.lib.duke.edu/papyrus/records/11r.html. This image is reproduced by kind permission of the Rare Book, Manuscript, and Special Collections Library of Perkins Library at Duke University. All rights regarding the use of this image are governed by the copyright statements at this location: http://library.duke.edu/specialcollections/services/copyright.html .

I had a strange feeling that this was some kind of time machine. I could sense the brush of an ancient hand across the page, dabbing the surface with an inked, chisel-edged reed. I felt a mind present in the paper with something that needed to be said, not just 1800 years ago, but for a long, long time. Our host explained that this was a marriage contract that gave the groom the right to build on part of a monastery owned by the bride. It was an utterly singular object. More importantly, it seemed to capture something of the human spirit, holding it in a way that could speak across a huge arc of time.
This is not an experience I get very often at work.
Many steps in the story have to be reconstructed at this point. Documents were stored in jars and shelves in houses, temples, and government storage buildings. Libraries were sacked or burned for cultural reasons, reducing the number of copies of documents. Languages, cultures, and writing materials changed drastically over the centuries. Whole cities were lost and forgotten. Moisture, mold, fire and vermin had their way with the papyrus fibers. It is a wonder anything survived.
Then came late Medieval and early Renaissance times. Scholars like Marsilio Ficino and Pico della Mirandola began translating and promoting works from ancient languages, making them available and relevant to the cultural changes leading to the Renaissance and the renewal of Western Civilization. Subsequent to the translations came corrections, commentary, and searches for definitive sources. The Renaissance accelerated scientific study, resulting in a revised sense of time based on geological observations. The change in the sense of time revived interest in ancient stories, such as Homer’s epics, which in turn led to archeological expeditions looking for evidence of the historical truths behind literature and other writings. Old cities were uncovered, and there, in the jars, shelves, and boxes where they had been waiting for centuries, were the fragile records of long forgotten humans.
Next installment: The story continues into present.

Conversion is Dirty – 2300 Years Later (Part α of γ)
September 1st, 2009 by ScriptoriumTech
by David Kelly
We think of structured text and XML as being for content that changes rapidly, because that’s what those tools are good for. So why did we get involved in a conversion project where the content is from millenia-old papyri written in ancient Greek?
On a gray spring day, Simon Bate and I found ourselves in the small Papyrology Room in a sub-basement of Perkins Library at Duke University. I scanned the spines of old papyrological journals while we viewed arcane diagrams on a large flat-panel screen and listened to the meeting’s organizer. The screen flickered and went solid black. Cell phones came out, people went running out of the room…
We were a motley group, assembled for obscure purposes, hungry for information about the dirty task at hand. Leaning intently over the table were the Scottish professor from Heidelberg, a specialist from London College, and our hosts, scholars of Classical Studies from Duke University and New York University. The rest came from around the States with a variety of software backgrounds. An unlikely group, indeed. What ancient secrets were we poised to unleash?
The story deepened as we listened. In 1931, Leiden, the Netherlands, a select group of classical scholars met to establish conventions for indicating the conditions of texts when transcribing the text from papyri to type (http://en.wikipedia.org/wiki/Leiden_Conventions). Later, Dr. David Woodley Packard (http://en.wikipedia.org/wiki/David_Woodley_Packard) would devise a form of SGML for transcribing the Leiden conventions into digital media, even building his own computers to store them (yes, he was the son of THAT David Packard). The SGML was given to Columbia University, and from there it went through an obscure trail, including merges and matches with other forms of digital markup from other universities.
Now our hosts were talking about a more generalized form of markup, the EpiDoc (http://idp.atlantides.org/trac/idp/wiki/EpiDoc) standard of the Text Encoding Initiative (TEI) (http://www.tei-c.org/index.xml). The activities are organized under the rubric of Integrating Digital Papyrology (IDP) (http://idp.atlantides.org/), which is described as “a collaboration between the Duke Data Bank of Documentary Papyri (DDBDP), the Heidelberger Gesamtverzeichnis der griechischen Papyrusurkunden Ägyptens (HGV), the Advanced Papyrological Information System (APIS), and several leading research institutions.” Various parts of the project are funded by the Andrew W. Mellon Foundation and the U.S. National Endowment for the Humanities.
Our heads were swimming.
NOTE: The following image is of a letter on papyrus from Oxyrhynchus, written in Greek dating from the second century AD. This image is in the public domain and was reproduced from this location on Wikipedia: http://hsb.wikipedia.org/wiki/Dataja:Ac_papyrus.png. Please see this website for additional information about this papyrus.

Despite all the signs of an archeological pulp-story adventure, we were there for a typical business reason. Huge amounts of tagged text needed cleanup after having been converted to EpiDoc.
One of the big problems was with Greek numbers. The ancient Greeks had a curious numeric notation system before those neat Arabic numbers came along. The complexity of numbers, combined with the complexity of the Leiden textual markup conventions, had given fits to an earlier conversion of the markup. It was time to call in the Pros from Dover. (Insert film clip of David and Simon as Hawkeye and Trapper John entering the operating room in a military hospital in Japan, golf bags in tow, umbrella at the ready.)
In ancient Greek, numbers are represented by the letters of the alphabet – with a twist. The character used depends on what place the number is in. For example, the numeral one in the one’s place is a lowercase alpha (α). In the tens place it is a lowercase iota, which looks like a one (ι). In the hundreds place it becomes a lowercase rho (ρ). And in the thousands place, it becomes an uppercase Alpha (А). So the number 1111 is Αρια. Pity the poor schoolchildren.
It gets worse: the character for the one’s-place number 6 doesn’t occur in the standard alphabet, but is called Stigma, and you have to install special fonts to represent it. A 90 is a Qoppa, which is in the extended UTF-8 character set – and which has four characters associated with it, depending on whether it is being used as a number or a letter. A 900 is a Sampi, sort of a left-leaning crescent moon with two strokes in the middle of the inside curve.
One kind of problem comes when Leiden convention characters are introduced into a number. For example, the original typographical markup in Leiden conventions might have been:
Αω[μζ]
In Arabic numbers this is 1847, with the “[47]” part being supplied by the editor. (The reason for being able to supply the “47” might be internal clues within the document or references to other documents.)
Now the SGML conversion takes place, resulting in the following:
<num>18</num>[<num>47</num>]
The square brackets still appear, and because there is no space between the <num> elements and the open square bracket, the syntax still represents the fact that this is a single string of numerals, i.e., 1847. But the place values associated with the Greek characters have been “lost in translation.” Look what happens when this number gets converted to EpiDoc in the first round:
<num value=”18″>ιη</num><supplied reason=”lost” cert=”high”><num value=”47″>μζ</num></supplied>
The EpiDoc conversion is based on the value of each <num> element, so the 18 gets incorrectly converted to an iota-nu, and the 47 gets correctly converted to a mu-zeta. The correct value for the “18″ is actually 1800, so the Greek should have been, as in the original, Alpha-omega, or Αω. For scholars, we were told, the iota-nu is egregious nonsense.
Our transform had to find all instances where a <num> element was followed by (or contained) one or more <num> elements and did not have an intervening space. Having found these instances, the transform then reconstructed the initial number value and supplied the correct Greek characters with the correct tagging sequence. The original markup in Leiden conventions was 18[47]. So the correct solution for EpiDoc would be as follows:
<num value=”1847″>Αω<supplied reason=”lost”>μζ</supplied></num>
In fact, I copied this string from a report that was prepared to show the proposed transforms before we applied them to the text. Once the report was approved, the transforms were applied to (deep breath here) 60,000 XML files representing the combined papyrological scholarship of the entire planet for the last 78 years.
We HAD to get it right – or the consequences could be millenial.
Stay tuned for Parts β and γ of this serial blog – coming soon to a Palimpsest near you.

On the shoulders of open source
August 24th, 2009 by ScriptoriumTech
by David Kelly
Every time I mention that we are exploring a new open source application to help support our customers, our Business Development VP, Matt Arnold, asks the very reasonable question, “How do these guys make a living?” I usually mumble something about “support services, extensions, customizations.” This is probably just to make me feel like they are making a living. I do this with the knowledge that a goodly part of my own living is dependent on many people whose hard work I don’t feel like I do enough to acknowledge or support.
Recently my wife and I entertained a friend and his new lady-friend. She mentioned that her 27-year-old son wrote software for an open source rendering engine for game developers. Without thinking nearly hard enough about the social graces, I popped out with Matt’s standard question: “How does he make a living?” To which she replied, fortunately with a laugh, “He doesn’t.”
I suddenly had visions of a pale young man living in a basement, his resume a mile long with hard, complex work, and no one to help him pay his bills but a sympathetic mother. I was imagining, of course, but you get the point.
The difficulties I have in supporting open source applications are complicated. For one thing, I and my coworkers are busy people who focus on supporting our own customers and their demanding schedules. To be blunt, there is seldom time.
Sometimes we make improvements to open source tools, but these are in response to customer needs. Our customers own the fruits of our labors. We might conceivably build clauses into our contractual agreements that allow us to migrate improvements back into the open source community – but having spent a brief period as an expediter of commercial contracts in an engineering corporation, I can assure you that it isn’t going to happen. Legal departments around the globe are twitching and flinching in their beds, wide-eyed and sweating obscure Latin legal terms at the thought. (This is not to say that it doesn’t happen occasionally – but I would say it is not common.)
We do provide a certain amount of help to the open source community through implicit testing, bug discovery, and clarification in the form of forum questions. But I can recall more than one problem we have had with an open source tool that could have been addressed with a small donation earmarked for a certain fix. This would have benefited the entire community as well as our customers, and it probably would have been done more efficiently. As it happened, because we had a specific commercial contract, we developed the solutions ourselves. And it is difficult for us to justify spending our own money on a software issue that is really our customer’s.
General donations are probably needed and deserved – but then one starts to wonder about the reasons open source applications are open source. Isn’t it good for things to be free, because that means people are doing things for all the right reasons, so we get good software made by people who want to make something good? And isn’t part of the theory that in an open source community, multitudes of people contribute small, affordable chunks of time so funding isn’t needed? (And this happens how often? Most of the open source applications I know seem to depend on one or a few dedicated people who are making a living by – other means. I don’t ask.)
We can also ask the question, where exactly do donations go? Would I be able to see itemized account books for an open source application if I asked for it?
One situation we have found is that when we do work for the academic community, they are eager and happy for our work on open source tools to be fed back to the source. This makes sense for them, of course, because they are in the business of fostering improvements in the intellectual community. And if I have learned anything about the academic world, it is that they appreciate free tools.
I don’t know if there are any firm conclusions to be drawn out of all this, and I’m sure the topic could be explored in much more detail. It does seem to me that commercial businesses need to explore some kind of model for supporting open source tools, similar to the academic community. As a consulting company between large customers and small development resources, it is sometimes an awkward balancing act. It is a complicated problem, but it is also clear that many people benefit from the hard work of a community who are chronically underappreciated and often, as I have discovered, personally underfunded.
If you have thoughts or examples about business support for the open source community, I would like to hear them.

Automated trademarking in structured documents – DITA in particular
June 22nd, 2009 by ScriptoriumTech
by David Kelly
Unabashed plug warning: The following entry gives a conceptual overview of a solution Scriptorium has implemented for managing trademarks in structured tagging. And we’re proud of it.
You know the problem. According to your style standards, only the first instance of a given trademarked term should display the trademark symbol. Structured documentation allows you to re-use document parts (such as DITA topics) in just about any order you like. In Manual A, the first file containing the trademarked text is, say, Topic A; in Manual B the first file containing the trademarked text is Topic E, which is also used in Manual A. Where do you put your trademark markup, and how do you maintain it when running Manual A and Manual B at approximately the same time?
Maintaining the trademarks by hand adds a level of effort that becomes non-negligible when you start considering a large number of manuals. And the process becomes error prone – those darned human beings. Different writers might tag things different ways, trademarks might escape notice, or markup might be inserted in inappropriate places by accident.
Isn’t this one of those problems that automated documentation was supposed to solve, not create? I once had a professor who said that computers were supposed to handle the work that computers could solve so people could work on the problems that only people can solve.
More than one of Scriptorium’s customers has presented us with this problem, so we know it is not uncommon. We have found a way to deal with the problem in DITA, and we believe that the principle is sufficiently generic to use in non-DITA structures as well.
To begin with, forget conditional processing. It won’t help you with the problem of marking only the first instance of a term. In the example of Manual A, above, setting the condition “Manual A” would still display the trademark in Topic A and Topic E. This is not what your editor wants – and he or she will let you know it in spades if he or she is any kind of editor at all.
Scriptorium’s solution for DITA, in simple outline, is as follows:
-
Using XSL, go through the ditamaps and remove all trademarking from the document files.
-
Following a predefined list of trademarked and registered trademarked terms, go through the ditamaps and identify the files that contain each term. Create a temporary file that lists the relevant files in order of book occurrence. (This step prevents having to crawl through the ditamaps more than once.)
-
Using Perl, iterate through the files listed for each term in the temporary file. Check the occurrence of each instance of the term, in text order, and evaluate whether it is a valid occurrence that requires trademarking. If so, wrap the appropriate trademark markup around it and go to the next trademark. If not, keep going through the text and the list of files until you find a valid occurrence of this trademark.
We possibly could have used XSL instead of Perl for the third step, but Perl’s text manipulation capability is much more robust than XSL’s, so we chose Perl.
In the implementation, the trademarking utility is coordinated by an Ant process. A user runs this utility just before the book is rendered for output. Being in Ant, the trademarking process could probably be integrated into the DITA Open Toolkit build system fairly easily to create a seamless, one-step production process.
There are a number of interesting problems that arise during implementation. For example, in step 3 the process has to evaluate whether the instance of a term is valid for trademarking. Some kinds of non-valid instances of a term in the text might be:
-
The term is in an indexterm tag.
-
The term is in an href attribute.
-
The term is in a title.
-
The term is in a codeblock tag.
You might also encounter a condition where a trademarked term could be both mixed case and all uppercase. Per your style guide, only the first instance of either should be marked, but not the first instance of both. That sort of requirement makes life just a little more interesting for a coder.
In general, the issue of trademarking first instances is not a simple problem to solve, and variations in style requirements will undoubtedly add complexity and challenges to the problem. But that’s what automated documentation is supposed to be good at, right? So we humans can get back to doing the more difficult problems that only people can solve.
I’m not sure – is that really such a good deal?

A different take on Twittering and technical writers
June 2nd, 2009 by ScriptoriumTech
by Sheila Loring
Technical writers abound on Twitter as do blog posts on how Twitter can make you a better tech writer.
I’d Rather Be Writing has an alternate take in the article Following the NBA Can Make You a Better Writer. Tom Johnson uses the analogy of Kobe Bryant and Lebron James playing their respective positions on the court. He argues that unless you’re a one-person shop, you’re doing yourself a disservice by trying to be a Jack- or Jill-of-all-trades. Play up your strengths, and minimize your weaknesses, tech writers. Read Tom’s article for more.

How Twitter makes you a better writer
April 1st, 2009 by ScriptoriumTech
by Sheila Loring

To Twitter or not to Twitter? That’s the question many technical writers I know face these days. Critics say writing in 140 characters or less is ruining our ability to communicate effectively and follow grammatical rules.
In the following article, Jennifer Blanchard argues that Twittering actually improves your skills as a writer:

On-the-Job Learning
March 25th, 2009 by ScriptoriumTech
by David Kelly
Having changed jobs not long ago, I find that my strategies for picking up work-related knowledge are changing. Perhaps it is my sense of survival slowly kicking in. (I really don’t want to win a Darwin award if I can help it.) Or perhaps I am waking up to the fact that there is more to the job than the problems currently at hand.
In my previous job the deadline pressures meant that I became adept at learning just enough to solve the problems before me and not much more. I became problem focused, and my learning mode was to glean information quickly from a variety of resources to find a solution, implement it, and then sprint to the next problem. From a just-in-time production perspective this was a satisfactory strategy. However, I don’t believe it left me well-prepared to identify problems, anticipate problems, and propose proactive solutions.
This is not to say that deadline pressures aren’t still an issue and that specific problems don’t need to be solved on a daily basis. But I do now find myself becoming aware that part of my job, not only as a Scriptorium employee but as a long-time toiler in the fields of technical communications, is to provide a broader perspective on problems, to anticipate issues, and to provide alternatives that include the benefits of the wide range of tools and knowledge that I have supposedly learned in the meantime.
So instead of skimming and gleaning to find specific solutions that match my current problems, I find that I need to open myself up to new directions, new technologies, new relationships, and new patterns of thinking. And this need brings me to some questions – questions for which I don’t have any particular answers, and for which I hope the readers of Palimpsest will be able to contribute their own perspectives.
What are the best methods for learning the general trends of the technical communications industry, quick overviews of specific types of solutions, what works and what doesn’t? How much time do you think is reasonable to spend at work doing this kind of open-ended research, and how much do you spend of your own time? Do you have favorite websites (after Palimpsest, of course)? Preferred modes of learning (podcasts vs. RSS feeds vs. blogs vs. forums vs. ???).
Here are some of the resources I currently use:
Nabble – For all things FOP (theopen source XSL:FO processor): http://www.nabble.com/FOP—Users-f353.html
EServer TC Library – Technical Communications articles: http://tc.eserver.org/
Mulberry Lists (specifically for XSLT): http://www.mulberrytech.com/xsl/xsl-list/
And, of course, Google.

Adobe Photoshop Express
March 20th, 2009 by ScriptoriumTech
by Sheila Loring
Photoshop Express is a photo editing and sharing web-based application. You can store up to 2GB of photos for free. Accounts are available for $19.99 (20GB), $39.99 (40GB) or $99.99 (100GB) per year.
You can either upload photos or log in to your Facebook, Flickr, Photobucket, or Picasa accounts and edit the photos directly. When I logged in to Flickr and Facebook, my albums all appeared in Photoshop Express along with the descriptions. The tags didn’t make it for some reason though you can tag the photos in Express, but still, I like the Web 2.0 integration.
The typical editing options are available — sharpen, crop, rotate, red-eye, white balance, hue, etc. It’s very similar to Picasa. You can email and link to photos, order prints through Shutterfly, and decorate with cute objects, costumes, conversation bubbles, and other stock graphics.
Once you share photos, friends and family can visit your account online.
Photoshop Express is worth checking out if you don’t want to pay for Photoshop Elements (my personal favorite) or Photoshop and want to edit photos on social network web sites. The editing features are more robust than Flickr (but similar to Picasa). The main downsides are lack of online community (like that of Flickr) and expense. (I get unlimited online storage at Flickr for $24.95/year.)
Update March 21, 2009
I love the integration of Photoshop.com and Photoshop Elements. I can create an album in Photoshop Elements, upload it to my Photoshop.com account, and easily order prints. Cool! Other programs offer this feature, but I’ve never used it in Elements. I’ve only used Elements to organize and edit photos.

Help understanding the new FrameMaker 9 interface
February 19th, 2009 by ScriptoriumTech
by Sheila Loring
FrameMaker 9 has been redesigned to look like Adobe’s other products — InDesign, Photoshop, etc. The new interface can be downright confounding and disappointing for long-time FrameMaker users. You’ll want a good introduction before upgrading.
RJ Vasquez, Senior Product Evangelist at Adobe, will offer a 90-minute e-learning session on FrameMaker’s interface on February 26th. For details, go to:
http://blogs.adobe.com/rjacquez/2009/02/elearning_session_on_the_new_u.html
Don’t forget to read my review of FrameMaker 9:
http://www.scriptorium.com/palimpsest/2009/02/framemaker-9-review.html
I evaluate the new interface (complete with screen shots) and features such as importing PDF comments, preserving CMYK in PDFs, new book options (section numbering, for one!), and the Adobe AIR interactive online help.

Review of screen capture programs
February 17th, 2009 by ScriptoriumTech
by Sheila Loring
Matthew Ellison reviews seven screen capture programs: FullShot, HyperSnap, SnagIt, Madcap Capture, RoboScreen Capture, ScreenHunter (free), and TNT. He also points out what to look for in a screen capture tool and compares features in a handy table.
http://www.writersua.com/articles/capturetools/index.html
SnagIt lands at the top of the bunch. Matthew describes it as “the most full-featured of the capture tools reviewed in this article.”
I’m a recent SnagIt convert after using Paint Shop Pro for years. SnagIt can’t be beat for a quick, easy screen shot. I also like the torn edge options to indicate a partial shot of the GUI. But the jagged edges might be more of a creative device than helpful visual cue. What do you think?

Essential tools of an XML workflow in the publishing industry
February 2nd, 2009 by ScriptoriumTech
by Sheila Loring
Communications from DMN provided a link to a webcast on Essential Tools of an XML Workflow. The webcast focuses on the book publishing industry. It’s interesting to hear that some publishing houses still allow authors and editors to use Microsoft Word. These folks are often viewed as incapable of learning an XML authoring tool. Many times the Word content is sent to an indexer for tagging.
The companies I’ve worked with don’t give their employees the choice of publishing tools, but if you’re Stephen King, you probably won’t be forced to use an XML tool.
Technical writers, if you know how to work with XML, your skills are portable to publishing houses. Don’t overlook this in a job search.
http://toc.oreilly.com/2009/01/webcast-video-essential-tools.html

FrameMaker 9 review
February 2nd, 2009 by ScriptoriumTech
by Sheila Loring
FrameMaker 9 has been out for a few weeks, and FrameMaker users having been buzzing about the new interface and features. See what I have to say about it before you upgrade.
Read the review:

How to trim your Twitter feed
January 30th, 2009 by ScriptoriumTech
by Sheila Loring
Receive too many Twitter’s throughout the day? You might not want to receive every @reply I send to my followers, so weed them out by doing this:
Go to http://twitter.com/account/notifications
Change @replies to read “Show me @replies to the people I am following.”
This apparently is the default setting, because it’s already selected in my preferences, and I haven’t touched them.
From Chris Brogan’s blog:
http://www.chrisbrogan.com/unswamp-your-twitter-feed/

A birthday cake for your favorite programmer, not auntie
January 29th, 2009 by ScriptoriumTech
by Sheila Loring
Another reason to use Thunderbird. Someone ordered a birthday cake through email, and the Microsoft Outlook code ended up on the cake. *My* aunts wouldn’t understand, but I know several people who can appreciate the goof.
http://www.labnol.org/internet/microsoft-outlook-ruins-birthday-cake/6824/

Hexadecimal and unicode converter
January 28th, 2009 by ScriptoriumTech
by Sheila Loring
I found a cool utility on the web for converting characters to hexadecimal to unicode characters (UTF-8 and UTF-16). Go to http://rishida.net/scripts/uniview/conversion.php and type the character in the Characters box. Click Convert. The hexadecimal and unicode equivalents are displayed. Decimal codes are also provided along with some other goodies like JavaScript and CSS escapes and HTML codes.

Importing comments into FrameMaker 9 documents
January 27th, 2009 by ScriptoriumTech
by Sheila Loring
FrameMaker 9 boasts this handy feature: you can import notes and comments from PDFs into FrameMaker docs. First, you must save the document as tagged PDF in FrameMaker 9. Then you mark up the PDF, adding notes and deleting or inserting text. Then you select File > Import > PDF Comments and select the file. The insertions and deletions show up as conditional text in the FrameMaker document, with the familiar red strikethrough for deleted text and blue underline for inserted text. The sticky notes are markers in which the marker text shows the sticky note content.
This is all great, thank you Adobe, and so on, but I’m having a problem with the sticky notes. When I edit the marker text in FrameMaker, I expect that updated note to show up in the PDF after I save the file as PDF again. The note doesn’t display at all. I’m wondering now what the point is if you can’t “round-trip” notes. Perhaps it’s user error, but I don’t think so.

Over 20,000 free online books
January 15th, 2009 by ScriptoriumTech
by Sheila Loring
I found the coolest online resource for free books! shop.ebrary.com offers over 20,000 full-text online books in a variety of topics — everything from the humanities to science, technology, and even sheet music. You can search the text, title, subject, author, or publisher and specify the type of publication (book, sheet music, map, journal, etc.) and the language. When you get the list of search results, you click a book title to see the search terms highlighted on the page.
The ebrary reader, a quick DLL installation, lets you add the book to your bookshelf, highlight text, print, copy and paste, navigate, define words, translate, buy the book, search, and bookmark pages. For example, my bookshelf contains a book called Dreams in Myth, Medicine, and Movies. I’ve bookmarked a page that discusses surrealism in Hieronymus Bosch paintings. In the bookshelf, the bookmark shows up as an icon that you click to view the page. The ebrary reader really sets apart this collection of books from Gutenberg and eScholarship.
The ebrary actually includes books you’ll recognize, such as Michael Kay’s XSLT 2.0 Programmer’s Reference (3rd edition — a little old but still relevant) and the Adobe Classroom in a Book series. I even found a book called Opportunities in Technical Writing by Jay Gould.
There’s one catch. You must set up an account with at least $5. That’s to cover copying and pasting or printing that you might choose to do. I think it’s a small price to pay, particularly if you never spend it!
Do you know any free resources for similar books — particularly XML/XSLT books?

What’s happened to my local FrameMaker user’s group?
January 13th, 2009 by ScriptoriumTech
by Sheila Loring
The North Carolina FrameMaker User’s Network (NCFUN), which became a STC Carolina special interest group, formed over a decade ago to support technical writers in the Raleigh/Durham area. Speakers (local and international) presented on everything from complex autonumbering and FrameMaker plugins to single sourcing and using reference pages. We usually had a topic for beginners and more experienced FrameMaker users.
During the meeting, users had the chance to get advice regarding problems in FrameMaker. We also had the obligatory snacks and refreshments — not to be underestimated, especially with one sponsor who always served delicious food.
The group also had its own mailing list. Users emailed questions between meetings, and we also sent meeting announcements to the list.
I write in the past tense because the group no longer meets. Over the years, attendance dwindled to a group of 10 and then 5 and then 3 regulars. It became more difficult to get speakers, and toward the end, the few regulars just talked (also not to be underestimated, but still…). I missed having this intimate FrameMaker resource. At every meeting, I learned something and had good geeky fun.
What happened? Did other FrameMaker resources become easier to use? For example, Framers is a very active mailing list with hundreds (maybe over a thousand) subscribers and anywhere from 5 to 20 messages a day. The group has a mailing list, whose archives are a bit difficult to search but are accessible through Google.
Did the time or location make the meeting difficult to get to, were fewer technical writers using FrameMaker, or did the topics no longer have wide appeal?
I suspect the answer is all of the above. Attendance at other STC Carolina special interest groups has also dwindled, and now none of them meet on a regular basis. Perhaps our members had too many groups to choose from each month and just gave up.
Ironically, now that I’ve been working on some non-FrameMaker projects, I’m less interested in the group, but I’d still like to see what other FrameMaker users are up to. Someone’s always using FrameMaker in a creative way.
Have user groups in other parts of the world gone the way of the NCFUN?

The case of the abused apostrophe
January 7th, 2009 by ScriptoriumTech
by Sheila Loring
How often do you see abused apostrophes — possessives punctuated as plural words and vice versa? We might know the rules, but it’s easy to dash off an email and inadvertently use the wrong punctuation.
Andrea Wenger wrote an article on how to use apostrophes correctly, even when style guides disagree. It’s well worth the read: http://stc-carolina.org/newsletter/tiki-index.php?page=Obsessed+with+Possessives
Do you know what a genitive is? Read Andrea’s article and find out. Fascinating stuff.
Shameless plug: For details on other issues technical writers are most concerned about, check out Scriptorium’s Technical Writing 101: A Real-World Guide to Planning and Writing Technical Documentation. Some topics include writing for an international audience, creating indexes, working with developers, and writing documentation plans.

What I’ve learned as a member of the Society for Technical Communication (STC)
January 6th, 2009 by ScriptoriumTech
by Sheila Loring
I’ve been a member of the STC Carolina chapter for over 10 years. During that time, I’ve volunteered in several positions and met a lot of people. Many of them also volunteered for the chapter. One problem we had was finding new volunteers. Many non-profits, in fact, face this problem. Organizations come up with ideas for programs or events that benefit members, but without volunteers, those ideas sit on the table.
In a way, I couldn’t understand why a technical writer wouldn’t want to volunteer to do SOMETHING. I had fun. It’s the best way to get to know other chapter members, plus I learned a lot professionally and personally. Here are a few examples of things I’ve done and what I’ve gained:
- FrameMaker SIG manager: Made contacts with FrameMaker users, practiced giving presentations, and picked up lots of tips ‘n tricks.
- Competitions committee: Learned how to improve documentation and also discovered what NOT to do. Gained experience managing projects.
- Communications manager: Discovered that sending emails to a 300-member chapter exponentially increases your visibility. People really do read their email.
- Webmaster: Learned how to update web sites in both standard HTML and wiki formats and manage discussion lists. Gained experience juggling tasks, because I was communications manager and webmaster.
- Newsletter production editor: Currently my creative outlet and opportunity to learn how to write and produce publications via a wiki. I also get to do a little editing.
- Vice president: Learned how much work is involved in running a chapter!
- Chapter meeting participant: Learned about all aspects of technical communications — from career development and writing in Simple English to single sourcing and online help development tools. Also learned how to be more outgoing — go up and say “hi” to someone and chat. That was really hard at first but has gotten much easier.
In addition to picking up skills, I met dozens of local technical communicators. These tech writers are not introverted, dull people, contrary to the stereotype. Their hobbies are as diverse as riding motorcycles, whittling wood, and participating in triathlons. Some of us meet every few months at a restaurant or theatre. We call it the STC Ladies Night (but it’s not an official chapter activity like special interest groups). BTW, if you’re a female Carolina chapter member and want to attend, let’s talk.
I can’t say that every minute has been completely enjoyable. Sometimes the newsletter deadline coincides with a personal crisis, and the last thing I want to do is hunt down graphics for articles. And you can’t do everything! You’ll WANT to volunteer to do several small things at once, which then amount to one BIG thing, and then life happens. You have a deadline at work or a family member has surgery. This brings me to the next lesson in volunteering — learning how to say no.
So all in all, my experiences volunteering in the STC Carolina chapter have been very rewarding. I recommend dipping your toes in the water and trying something out. Email or call someone on the administrative council and ask what needs to be done. Start small. You might be surprised at the return on investment.
I’d love to hear about your volunteer experiences. Do share.

How do you manage your RSS feeds?
January 6th, 2009 by ScriptoriumTech
by Sheila Loring
How do you manage your RSS feeds?
Over the past few years, I’ve subscribed to 90 RSS feeds, many of which I (currently) read on a regular basis. Google Reader is my reader of choice. This reader makes it easy to organize feeds by topic. All of my news blogs are grouped under Current Events. Beauty and fashion are filed under Frivolity. Art and photography are considered Self Expression…and so forth.
Some feeds are only skim worthy, while others I read word-for-word. Still, 90 feeds is really more than I can realistically keep up with. The question of which feeds to unsubscribe from plagues me. How long does one subscribe to a feed before deciding it’s not worthwhile?
The decision can be compared to investing, when you drop a mutual fund after a few years of consistent losses. But with feeds, I allot a few months to judge the quality of the content and how often the author contributes. My feeds typically provide practical learning experiences. That sounds boring, but it’s not really. Gryphon Journals addresses topics of interest to technical communicators — analyzing product documentation for usability, improving your writing, and so on. Gen X Finance provides advice for achieving financial independence. I don’t read a lot of fluff, except for the few filed under Frivolity!
Some feeds are from consultants or individuals selling products and services. I weed out those that don’t have substantial content. Gretchen Rubin promotes her book on The Happiness Project, but I always learn something from her posts. (See Tips for getting your sweetheart to do chores—without nagging.) On the other hand, many corporate blogs are pure marketing tools and don’t provide any practical content. I usually don’t bother with those unless I just want to keep up with the company’s activities.
Here are a few more of my daily must-reads:
- I’d Rather Be Writing: Trends in technical communication from Tom Johnson.
- WorkLifeLove: Holly Hoffman’s views on building relationships, working effectively, and basically living life.
- Mind Hacks: Tips for using your brain!
- Conversation Agent: Valeria Maltonis’ advice for optimizing your social network — from marketing with Twitter to writing effective blog entries.
- Dr. Macro’s XML Rants: Eliot Kimber’s lessons learned in XML and associated technologies — XSLT, technical publishing, Java, etc.
- Digital Photography School: Product reviews, lessons in digital photography, and tips for managing and sharing photos.
What are your favorite feeds, and how do you prevent them from getting out of control?

On the Unspoken Rule
December 19th, 2008 by ScriptoriumTech
by Sheila Loring
Ben Minson at Gryphon Mountain Journals published an entry on the unspoken rule — technical writers who don’t read documentation. Admit it, you’ve been guilty from time to time. For me, learning to use a product on my own is an enjoyable challenge, one reason I’m a technical communicator. For a more complex product, such as the digital photo frame I just bought, I like to read the manual from cover to cover (or at least until the English translation ends), particularly if I’m teaching someone (hi, Mom) how to use the product.
When technical instructions are sparse or unclear, several scenarios come to mind:
- The manufacturer doesn’t value thorough documentation.
- As a result, an inexperienced (or non-) technical communicator wrote the documentation.
- The book was poorly translated.
More often, though, I consider the conditions under which documentation is written:
- How long did the writer have to complete the project? Perhaps the development cycle didn’t allow for sufficient screen shots or editing.
- How often did the product change throughout the writing process, so that last minute updates were missed? Even the best laid plans suffer when engineers tweak the interface or add new features one week before publication.
- Do you find evidence of inadequate single sourcing, for example, references to “earlier in this chapter”? The writer perhaps didn’t have enough time to yank such references from an inherited book.
- Were the wrong tools used — Word instead of FrameMaker? (Word might be fine for short, simple documents, not long, graphics-intensive technical documents.)
Most of these challenges are beyond our control. Even so, as writers, we might cringe at the thought of other writers consuming the fruits of our labor.
I’m not saying we shouldn’t strive for complete, consistently formatted documentation. This is the goal of conscientious technical communicators even under the worst of circumstances. I’m saying sometimes the writer isn’t completely at fault.

Unstructured Documents in Structured FrameMaker
December 9th, 2008 by ScriptoriumTech
by Sheila Loring
A few days ago, there was a thread on the Framers mailing list regarding working in the structured FrameMaker environment. Someone commented that editing unstructured documents in the structured interface does not affect the unstructured documents. I found this to be untrue recently when FrameMaker 8 crashed. I had about 30 unstructured FrameMaker documents open in the structured interface — only because I’d previously been working in a structured document. When I restarted FrameMaker after the crash and opened a recovered file, FrameMaker displayed an error that the file contained structured information. Great. FrameMaker saved some kind of structured information in the file during the recovery effort.
We all know that crashes in FrameMaker are not infrequent, so this could easily happen to you. Save yourself a painful lesson and work in the structured interface only when necessary.

Shrinkwrapping Equations in FrameMaker
December 8th, 2008 by ScriptoriumTech
by Sheila Loring
Recently, I created several complex equations in FrameMaker 8/Windows XP Pro. I was very proud of myself for faithfully recreating the original equations imported as graphics from Word. When I went back to check my work, however, the equations were all wrong. Some of the characters had changed to question marks. My bubble suddenly burst! I knew I’d inserted the correct symbols! I had no idea what they meant, but they did look right.
Luckily, an Adobe knowledge base article (http://kb.adobe.com/selfservice/viewContent.do?externalId=319076&sliceId=2) addressed the problem in an older version of FrameMaker on the Mac. Characters displayed incorrectly after I shrunkwrapped the equation. After resizing the anchored frame a bit, the correct characters returned. The article also explained that this was strictly a display issue, and the equations would print correctly. Whew! My bubble quickly returned to its intact state.

A Day in the Life….
August 27th, 2008 by ScriptoriumTech
by Sheila Loring
Here’s an inside peek at a day in the life of a Scriptorium employee:
But wait, there’s more…a t-shirt!
Ladies, check out the regular expression skirt, too. The code is printed upside down for easy reading while sitting.

Doctrain: In.vision Xpress Author for Microsoft Word
May 8th, 2008 by ScriptoriumTech
by Sheila Loring
Michael Boses, CTO
Invision Research (www.invisionresearch.com)
Yes, Virginia, you can author XML in Word.
Company founded in 1996. Xpress Author for Word launched in 2002. Early markets were defense intelligence, central government, and pharmaceutical industry.
Focused on enabling Web 2.0 technologies with structured authoring.
In.vision provides structured authoring interface inside Word. Toolbars configured for XML are displayed. Menus are also tailored for XML authoring. Based on the cursor location in the document, only valid items show up in the menu. So you can’t insert a figure where only a title is valid.
Authoring based on styles. The styles drop-down list shows valid elements based on where you’ve clicked in the document.
Templates available. For example, create a document using the text book template, [Type title here] is displayed where you’ll type the chapter title, placeholders for objectives, review questions, further reading are also displayed. Boilerplate text can also be saved in the template.
You can copy and paste content (including graphics) from other types of documents — standard Word documents, spreadsheet, browser. Even a table with merged cells can be copied and pasted into structured doc. Structure is automatically applied.
Word’s beloved revision tracking and commenting features are supported. This information is stored as metadata in the document, and you can specify the type of comment based on your defined structured (such as a legal or draft comment).
Spell checking and thesaurus are also supported.
You can display a document map, which shows headings in the document. Click on headings to navigate through document.
What happens when you paste content into an invalid location? When you right click to paste the content in an invalid location, the drop-down menu Paste item is grayed out. If you Ctrl-V to paste, the cursor flashes to indicate illegal action (too subtle an indication for my tastes), or you can configure the program to display a friendly dialog box.
How do you see the XML tags? I found Michael’s response entertaining: We have a promise to the users never to show the tags. Not the answer I had hoped for!!!!! You can, however, export the document to XML and then display and/or edit the file in an XML authoring tool (Oxygen and XmetaL are my personal favorites, BTW).
Unfortunately the demo ran short, so we didn’t get to see much else, which is unfortunate. The web site (http://www.invisionresearch.com/xpress.html) doesn’t offer much detailed information, but do check out the brochure: http://www.invisionresearch.com/pdfs/xpressauthor.pdf. I have to chuckle at the front page of the brochure. Envision a suited gentleman with white hair and furrowed brow. “If you’re frustrated with XML authoring, we have a Word for you.” The photo says it all.
Based on what I saw, Xpress Author might be a good alternative for technically challenged content providers or companies married to Word. It isn’t, and doesn’t pretend to be, a heavy-duty XML authoring tool.

Doctrain: Documentation Planning and Library Design in a Web 2.0 World
May 8th, 2008 by ScriptoriumTech
by Sheila Loring
Nicoletta Bleiel
Senior Information Developer
ComponentOne
Quick summary: Web 2.0 technology can enhance technical documentation, but the disorganization and narrow focus make it a poor replacement for sound technical documentation.
Wikis, podcasts, community forums, widgets/gadgets, blog, and social network sites such as Facebook and MySpace are all examples of Web 2.0. Unless you’ve been living under a rock for the past 5 years, you already know this!
The bottom line is always giving your customer what they need. Nicky provided a refresher course in product and user analysis: scheduling site visits to watch the customer use your product, reading the customer forum on your company’s web site, asking tech support for common customer problems, and so on.
Find out how users interact with your product. Do they have the time or inclination to flip through a book, or do they prefer short training videos or cheat sheets? Are they comfortable navigating online forums to find help, or do they prefer online help? Do they have an Internet connection to access online forums or content? Customers with lots of feedback are more apt to participate in forums or blogs, particularly those who are passionate about the product.
Advantages of wikis: users contribute at their convenience, way to build communities of practice (COP), keeper of group memory.
Disadvantages: wiki syntax not intuitive, can be difficult to learn, malicious comments require censorship or some response, often unstructured and disorganized.
Check out this wiki: http://wikihow.com (collaborative attempt to build the world’s largest, highest quality how-to manual, everything from surviving a riot (http://www.wikihow.com/Survive-a-Riot) to crocheting a cat hat (http://www.wikihow.com/Crochet-a-Cat-Hat).
Advantages of podcasts: informal, very useful for certain audiences, easy to create, free or inexpensive, easy to find on iTunes, keywords in podcast increase findability.
Disadvantages: might not work for your customer (computer with no soundcard), need to generate material frequently.
Notable podcasts/podcast tools: talkshoe.com (live shows on web kind of like Webex, recorded as podcast), audacity.com (free podcast recording software), Odiogo at http://www.odiogo.com (text-to-speech converter)
Advantages of blogs: can optimize search engine results, used to promote/educate customers about product features, casual way to communicate with users.
Disadvantages: require frequent contributions or lose audience, must respond to feedback.
In summary:
Always maintain solid user assistance as a foundation.
Web 2.0 enhances but doesn’t replace what we do.
See the slides at http://www.doctrain.com/west/program_detail/documentation_planning_and_library_design_in_a_web_20_world/.

CMS/DITA Conference Session 14: Implementing DITA in the Real-World
April 10th, 2008 by ScriptoriumTech
by David Kelly
This session was presented by Raymond Yang Lei, a senior information architect with Huawei (a large telecommunications company in China). Mr. Lei apologized for his English, this being his first opportunity to use it in a public forum. And, in truth, there were some details that probably got lost in translation, although JoAnn Hackos was present to help clarify points of confusion.
(I believe most of the bilingual Chinese people in the Bay Area at that time were gathered in downtown San Francisco, engaged in other business.)
Mr. Lei’s story of a DITA migration was similar in many respects to the other stories we heard during this conference. Drivers for the migration included rapid growth of products, versions, and customers, and requirements from their customers for world-class documentation quality. (I formerly worked for a company that was occasionally in direct competition with Huawei. Telecommunications is heavily regulated, and while the actual writing quality may not be at issue, the format, timeliness, and accessibility of technical data can be a deal breaker.)
Reasons for choosing DITA were similar to others mentioned in previous posts. One interesting comment he made: one reason for choosing DITA was the growing user community. I heard, informally, from other people that they were initially skeptical of DITA simply because it was open source, open standard, and those kinds of projects were typically ephemeral. I believe there is a growing sense that DITA is likely to have greater longevity and penetration than was initially suspected.
Okay, I need to get to the airport soon so I’m going to wrap this up quickly – will try to repost if more occurs to me during the long flight. So here are some quick points:
* They used an Excel spreadsheet to identify subject/topic completeness and organization. Later this spreadsheet outline was used, via an external application they developed, to generate a ditamap and create links to the appropriate topics. Very cool.
* They use Clearcase to store and version content. It’s not a CMS, I’m here to tell you.
* Changing roles include the fact that the Information Architect and writers must now be more aware of a broader scope of product knowledge in order to structure data and reuse data appropriately.
* They evaluated all the CMSes they could and rejected them all. Reasons: Unicode support does not mean Chinese-ready; not all the CMSes fully supported DITA; CMSes are expensive, particularly in China, and the ROI does not justify it.
* Their localization consists of writing in Chinese and translating almost simultaneously to English, then publishing both simultaneously. Eventually they plan to write primarily in English.
There were a couple of other interesting points — but I’m out of here. Cheers!

CMS/DITA Conference Session 13: From Unstructured to Structured, From HTML/Frame to XML: A Case Study
April 10th, 2008 by ScriptoriumTech
by David Kelly
This session was presented by Padma Nappalli from Intel Corporation and described her documentation department’s experience migrating from a standard Frame/HTML environment to a DITA/CMS environment.
Many of the situations, frustrations, and successes that their team encountered sounded similar to those of other migration stories I heard at this conference (and that I have also experienced). Publishing requirements multiply, customizations are needed in output, multiple legacy formats are difficult to maintain, quality and timetables suffer, customers complain.
Ms. Nappalli’s department began researching XML solutions in 2005 and are now in production using DITA and Documentum. Documentum was already in-house, and they had little to no budget, so that’s what they use.
They discovered that with DITA they did not need a CMS right away. It was more important to structure the source material. They now have 6000+ topics in DITA, and they are able to supply man pages from DITA as well as PDFs, CHM, and HTML output. She said they had to develop their own method of outputting man pages from DITA as they could not find any solutions available.
Like others, they have seen a shift in roles, not only individually but departmentally. Two examples: one person was hired “as a writer,” knowing that he would become their publishing tools person. And they now have a CMS admin, so the department has taken on an embedded IT function.
A few unique points from her presentation:
* They still need to use Robohelp and FAR (not sure what this is) for helps, but it is last-minute production that takes far less time than before.
* They trained a single writer to rework the legacy data and do the conversion, to reduce overhead. Here is where the comment came out that she believed it was beneficial to do source tweaking before conversion.
* She says “It is not possible to use DITA out of the box.” So plan on tools development.
* They had the “blessings” of management, but no funding, no IT, and no tools development support. In spite of this, they apparently managed to obtain all of these things eventually, one way or another. To this point she told the traditional story of “The Arab and the Camel.” Although I’d heard it many years ago, I’d forgotten it, so I’ll repeat it here:
Out in the desert there was an Arab with a tent, and his camel waiting outside. A tremendous rain started coming down. The camel came over to the tent and asked if it could just poke its head inside to keep it dry. The Arab agreed, yes, that would not take up much room. Then the camel asked if it could bring its neck in, and the Arab agreed. Next came the shoulders, then the trunk and legs, then the tail, and soon the Arab found himself sitting in the rain.
I believe that is called an incrementalist strategy. Apparently it also works outside of folk tales.
Cautions that Ms. Neppalli passed along:
* Do thorough content analysis (I’m sensing a theme here)
* Think big, start small
* Don’t rush to conversion
* Start with highly structured content
* Don’t rush to buy tools (although she says the tools are much better now than when they started). She said their biggest challenges was in the tools.
* “Sandbag Your Plans.” Expect the unexpected. There will be surprises.
As I sit here typing in my hotel room, one of the 150,000 or so people bumped from an American Airlines flight, I wholeheartedly agree.

CMS/DITA Conference Session 12: Business Drivers for Changing to DITA
April 10th, 2008 by ScriptoriumTech
by David Kelly
This session was presented by Bruce Nevin, Program Manager and Information Architect for Cisco. His purpose was to present some of the drivers for moving to DITA. The raw data for the presentation came from Cisco’s own migration to DITA, which he called “an adventure.”
Initially they had developed their own information model after having evaluated and rejected DITA while it was still in its early stages. At a certain point the proprietary model became problematic, so they re-evaluated DITA. He stressed that the reasons for choosing DITA in the second round were more business reasons than technical. In a comparison of the technical aspects of DITA and the proprietary data model, the main differentiator was specialization, which, while valuable, was not a compelling reason in and of itself.
As an Information Architect and an academic linguist, Mr. Nevin strongly recommended performing a content analysis to specify the structure of you content. This leads to “discovery” – identification of inconsistencies – and “opportunities” – to make structures consistent. I would think one could also identify latent structures and find ways to make them explicit during this phase.
Some of the criteria that led them to select DITA included:
* Ease of sharing of data with their corporate acquisitions
* Long-term supportability (apparently a big issue with the proprietary data model – the IT department had problems with it)
* Vendors charge less for translating DITA (this, he admitted, was a rumor he had heard)
* Tool vendors can preconfigure for DITA.
Some challenges for DITA:
* It’s immature
* Industry standard specializations are needed
* It is based on XHTML format tags (for instance, he believes that a table and a list are the same thing and that the distinction is unnecessary legacy baggage)
* Filled with IBM legacy information (see audience comment in the session on the troubleshooting specialization)
* It is limited to DTD spec, even in the schema (not sure I understood this point)
The costs of going to DITA could be managed by a few helpful practices he mentioned:
* Cost: rewriting XSL. So delay specializations as long as possible.
* Cost: conversion. So use a quick and dirty, vanilla conversion as long as it renders okay.
* Cost: rendering. Leverage the Open Toolkit code as far as possible. But it will still require a lot of work. (I am prepared to sign an affidavit to support this statement.)
* Cost: customizations. So leverage vendors’ code, integrate with Open Toolkit. One example was Cisco’s command syntax for its network OS. They have a tool they built to integrate with XMetal that provides a pop-up from which you select human-readable elements of the syntax, and it populates the DITA file with the correct tags.
Of these, he considered specialization and XML cleanup deferrable processes in the migration process, but rendering and customization to be nondeferrable.
Management issues they encountered:
* Disparate priorities among the groups (limit size and complexity, get easy wins, try to maximize ROI as early as possible)
* Minimizing disruption to production (start with easy projects, avoid having writers switch between legacy work and DITA work, avoid round-tripping content – migrate only one direction)
It took Cisco 2-1/2 to 3 years for their migration effort.
A discussion arose based on an audience question. What factors would drive the migration timeline? Bruce thought it was a good question, he just had not thought it through yet. Some answers from both him and the audience:
* Money
* Time commitment
* Taking time to do research and make good decisions.
* Content analysis (familiar chorus). “Language is structured, XML makes it explicit.”
* Starting with well-structured writing “gets you 70% there” (Information Mapping mentioned)
* The need to migrate from disparate tools/processes
* Conversion while producing. Recommended incremental rollouts rather than mass conversion of all documents.
* Size of company is a variable.
* Avoid the pilot project also being a production project – “You don’t know what will happen.” But some audience members said they did not want to waste effort, so they picked projects that were only loosely time constrained or that were small enough to permit parallel development. One person said they also went for a project that had enough wow factor to get management’s attention.

CMS/DITA Conference Session 11: Selecting a Content Management System of DITA
April 10th, 2008 by ScriptoriumTech
by David Kelly
This session was presented by Scott Wolff of WOLFF & Associates, LLC. From my perspective, other than the sessions that presented testimonials of successful DITA/CMS conversions, this was one of the more on-target sessions I heard.
The substance of this presentation was to identify several DITA-relevant CMS functions, then show how a given CMS handled that function successfully. The intent was not to compare all CMSes per relevant function, but to provide some tools for undertaking that kind of evaluation oneself. And, of course, when asked whether he had come to any conclusions, Mr. Wolff replied, “Yes, and I’m available to be hired for consultation.” Or you can start with his suggestions and do the work yourself.
Again, I won’t rehash everything he said, but will focus on some interesting points:
* Eight vendors provided screen shots for his talk.
* CMSes can run anywhere from 10K (“But you won’t like it”) to $1M – $7M. All of the CMSes he presented could be had for less than $200K.
* DITA does not require a CMS because relationships between elements are tracked internally (links, conrefs, topicrefs). But CMSes provide other, “extra-DITA” values to content, including version control, where-used functions, workflow management, security, and asset protection (backups).
CMS functions he thought important, and the systems that did them well:
* Drag’n'drop ditamap editor (Vasont)
* In-system editing of DITA attributes (Vasont)
* Import new DTD to reconfigure the data model (and update the catalog) (Trisoft)
* Support for conrefs (usually difficult because CMSes typically do this) (Trisoft)
* Allow choice of a particular version of a topic for a map to point to (Trisoft)
* Import ditamaps and bring in the target topics, maintaining the links inside the CMS (Empolis)
* Where-used (where is a topic used) function (Empolis)
* What-uses (what uses this topic) function (Empolis)
* IT-friendliness (DITA Exchange, an extension of Sharepoint that gives it DITA capability)
* Word-like editor (DITA Exchange)
* Publish to Word 2007 (DITA Exchange)
* Dynamic folder structure (DITA Exchange – you’ll have to talk to Scott to see what he means by this one)
* Edit relationship tables (Ixiasoft DITA CMS Framework)
* Define conref by beginning with target, then finding insertion points (Ixiasoft)
* Integration with editor (Ixiasoft/XMetal)
* Diffing on ditamaps (Ixiasoft)
* Lease time on server rather than buy CMS (Astoria)
* Return search results as a ditamap (Astoria)
* Conditional publishing, filtering based on production status at the last minute (Astoria)
* Conditional publishing a different way (XyEnterprises Contenta)
* Maps and ditavals dynamically created (not sure how this works…) (XyEnterprises)
* Creation of an object that associates document resources into an object for document creation (Inmedius Horizon) (Does that sound like the definition of a ditamap? I’m curious how they are different.)
I should note that these functions and the systems are not a comprehensive list of what should be evaluated in a CMS. Several people, including Scott, mentioned JoAnn Hackos’ book on CMSes as being very helpful.
Also, I believe Scott would agree that his mention of one system for any given function did not mean that other systems would not also have good (or acceptable) solutions for the same problem.
Some warning signs that Scott proposes when evaluating CMSes:
* Solutions not seamless with DITA – “DITA-compatible” because they export DITA does not mean they store DITA XML – this can lead to configuration issues when changing data structures.
* CMS does not track DITA components – therefore it can’t track relationships between components.
He suggests that with a CMS will come new roles. This was echoed in several other presentations – new technologies mean new roles for writers and editors.
Maybe even managers? Can managers be retrained? Do I smell a possibility for a new presentation?
Finally, in response to a question as to whether there was an open source CMS that tracked DITA components, one of the audience members brought up “Componize,” a CMS based on Alfresco (another open source CMS) that would provide DITA component-level management. It is apparently available on Sourceforge. I have no other information to vouch for its appropriateness as a DITA/CMS solution. So go have fun – just don’t blame me!

CMS/DITA Conference Session 10: The Ins and Outs of Content Migration
April 10th, 2008 by ScriptoriumTech
by David Kelly
This session was presented by Joe Gollner, VP of e-Publishing Solutions for Stilo International.
When I first saw a “Stilo” case being carried in for their booth, I thought they might be associated with the Stilo Fountain Pen store I had seen just the day before in San Francisco. As someone who works for Scriptorium and who writes for a blog called “Palimpsest” I should have known better. Stilo does document conversions, among other things.
Slight diversion here. When I ran into my networking glitch early in the conference, I left my laptop in the hotel room and decided to just make notes in my conference proceedings book. I used a Cross ATX fountain pen, which I had to refill quite often. Then I took the proceedings back to the room, where I transcribed, distilled, commented, edited, rewrote, posted, and reposted to my little heart’s content. The irony of using what Mr. Gollner would call an “opaque” documentation method while sitting in a room full of extreme high-tech documentation practitioners – well, I’m not stupid. (Well, maybe I am, but not like that.) But I found the method curiously effective. When I hand-write, I focus on what is being said better, and, whether by muscle memory or (as research suggests) the immediate translation of auditory speech to written word, I retain more of what I hear. And the subsequent transcription process gives me the opportunity to…well, see above. It just took a lot more time than expected. And of course, now you have to endure these lengthy soliloquies.
Okay, back to the conference:
The central point to Mr. Gollner’s presentation was summed up, I think, by his reference to content as an Easter Bunny. You think you have nice content, nice Easter Bunny. Then he showed a slide with a monster bunny and the words “The Easter Bunny Hates You.” The Easter Bunny, it turns out, is full of surprises. (Look up “The Easter Bunny Hates You” on the web.)
This message was consistent with what I heard in several other sessions: data analysis is key to a successful conversion project. Even plain text has structure. A good conversion project should be able to capture as much as possible of the intelligence expressed in the source.
In speaking about the conversion of documents for a large pharmaceutical lookup portal, the qualities that ideally should be captured in the resulting text, according to Mr. Gollner, include:
Precision
Performance
Navigability
Portability
Timeliness
In general, the ideal criteria for documents before being converted to DITA include:
* valid XML
* modularized
* discretely addressable
* uniquely identifiable via metadata
* linked appropriately
* processable with “almost perfect confidence”
And of course, no one’s documentation looks like that before conversion. Instead, he defines a “spectrum” of document accessibility to conversion:
* Opaque (paper, fiche)
* Annoying (proprietary formats like Quark)
* Polluted (old HTML, things filled with deviations and noise)
* Tolerable (a documented format that exposes format and structure in processable form – to which category he grudgingly admits MS Word and Framemaker)
Against this challenge, his company has devised a process flow that they are able to use in almost all cases. Given that the slides were inscrutably dense, I don’t propose to discuss them. Works for them. They do conversions on the order of hundreds of millions of pages, so I won’t argue. And I imagine the results are of very high quality.
One point raised by your humble correspondent had what I would consider an arguable answer. The approach typically taken by Stilo is to automate all the conversion they possibly can, then do tweaking after conversion. I asked whether there was ever a situation where it might be preferable to do data tweaking before conversion. This is a question I have struggled with, without any helpful metrics, in my own conversions of hundreds or thousands of pages. Mr. Gollner’s answer was that it was always preferable to do the tweaking afterwards. Of course, he also later said the folks in his office would write a conversion script for a single page if given the opportunity.
However, interestingly, this morning I heard a session in which Padma Neppalli of Intel stated that she believed it was beneficial for them to do restructuring and tweaking of the documentation prior to it being converted. Again, without metrics, and at the risk of playing “he said, she said, did so, did not” — it does seem to me that there are opportunities for reducing conversion pains by doing some basic grooming of the source before conversion. It’s sort of like running a dust brush over your vinyl discs before converting them to MP3s (ouch, did I just date myself?). Yes, you can do cleanup of pops and scratches in the digital world, and very quickly from what I’ve seen, but if you have a more convenient method in the old “analog” world to give you a cleaner conversion to begin with — why not? It seems risky to lose information just to put it back later, if you can avoid it.

CMS/DITA Conference Session 9: Putting the M in CMS
April 10th, 2008 by ScriptoriumTech
by David Kelly
This session was presented by Harv Greenberg, Principle Technical Project Manager from XyEnterprises. This presentation focused on the used of mechanisms within the CMS (XyEnterprise’s Contenta CMS, specifically) to develop management tools.
I think the purpose here was to show that CMSs contain a lot of information and capabilities that can be built around content as a way to enhance the value that management brings to a documentation project. Some of the possibilities presented were “well-defined project formation” (similar to the methodologies built into the S1000D specification), and the development of tools for “information quality” (term checking, customer feedback, knowledge base integration, workflow metrics).
The session featured a lot of participation by the audience in suggesting types of management capabilities that would be nice to have, and demos by Harv of Contenta’s capabilities along those lines. One interesting request he said they had had was the use of the CMS as an invoicing system. With a little tweaking they were able to capture hours worked for each portion of built-in workflows and create reports capturing the task/resource information.
I’m not sure I learned a lot with this one other than to try to think about other possibilities for the use of CMSes. And that makes sense – with all the information captured in these systems, there should be a lot of possibilities for doing things with documentation that we have not necessarily considered before.
One thing mentioned in one of the other sessions (maybe it was JoAnn’s) was that the accelerated interest in DITA had renewed people’s interest in CMSes. Perhaps with a stronger adoption rate for CMSes, this will lead to more people thinking about how they can get more value out of these (very expensive) resources.

CMS/DITA Conference Session 8: Improving Source and Translation Quality with DITA and Content Management
April 10th, 2008 by ScriptoriumTech
by David Kelly
This session was presented by Ann Adams of Kyocera and Dan Dube of DocZone.com. The gist of the presentation was similar to several others at this conference, with a little different twist in the type of CMS that was selected. The story was similar to the pattern of others: a limited writing staff and diverse legacy sources as over against an exploding complexity of products, audiences, and document services/output types to fulfill.
Ms. Adams pointed to several keys to success and several points where they learned difficult lessons. Some of the things she thought helped:
* Attending a DITA User’s Group early on.
* XML training for authors. (When the XSL section of a video started, she turned it off – no need to confuse people.)
* DITA training
* A workshop on modular writing.
The CMS they chose, DocZone, helped them accelerate the process because it is a hosted solution – they contract for services rather than purchasing the CMS and related tools. DocZone helped them perform the data structure analysis and performed the initial conversion for them.
One point she made: they have continued to tweak the data structure as they have learned more about the structure of their legacy data. This was a common theme in the implementation stories I heard: take time to understand the structure of your data. More on this in another session.
Other contributions to success:
* Weekly meetings with the writers (“code reviews”)
* Weekely meetings with their vendor
* Using CMS workflow for reviews
Issues they encountered:
* Turnover in a smalll doc department causes significant loss of intellectual capital. One writer left, another writer was out six weeks on jury duty.
* The first conversion was easy, but subsequent documents uncovered more structure. They wound up changing their minds about the data structure so often they wound up purchasing the conversion software so they could reconfigure it at will rather than paying the conversion vendor each time they wanted a change.
Some interesting “side-effects”:
* The CMS provided high visibility into authors’ writing habits (e.g., making numerous corrections to a document over time creates a large version trail) Some writers did not like this.
* The ditamaps were good for book-oriented views, but writers wanted submaps showing only the topics they were working on.
* The concept of “done” is having to be redefined. Consequently it is hard to make estimates of when they will be “done.”
Two advantages they have derived from implementing the CMS:
* Expanded capabilities (new delivery outputs and methods)
* The use of “author memory” has enabled them to reuse source language more effectively.
What is “author memory?” I hadn’t heard that term before. Ms. Adams said that it was “like translation memory,” then introduced Dan Dube, who is, I believe, a product manager with DocZone, who picked up from there.
It turned out that “author memory” was essentially the same as translation memory, but in DocZone it is built in to the CMS. So is translation memory, for that matter. Author memory holds the source material, while translation memory holds the translated material. Except, if I recall the statement correctly, in the case of DocZone, the author memory and translation memory are the same.
At any rate, the “author memory” approach helps the author make fuzzy matches to similar language and decide whether it is viable to use existing language. According to Ms. Adams, this has resulted in significantly more reuse of content than they expected.
There was a discussion about the adoption of DocZone’s version of translation memory vs. traditional (e.g., Trados) mechanisms. Mr. Dube said translation vendors were wary, but were also interested because it enabled them to avoid paying Trados licensing fees to SDL, which also does translations in addition to now owning Trados. Interesting.
The CMS/DITA implementation for Kyocera took about 1 year. This sounded significantly less than the figures I heard for “owned systems”. On the other hand, Kyocera is also now locked in to regular payments — so I guess it depends on what you are looking for and what kind of cost model you are willing to swallow. And, I suppose, how customizable you want your system to be.

CMS/DITA Conference Session 7: The DITA Troubleshooting Specialization
April 8th, 2008 by ScriptoriumTech
by David Kelly
This session was presented by Carolyn Inkster (project manager) and Dan Dionne (technical lead) from IBM. They were responsible for the development of the troubleshooting specialization that has been placed in the open-source keeping of OASIS. It is currently available as a separate download for integration with DITA-OT 1.4.
The need arose based on the variety of problem/resolution formats across IBM’s documentation set. More than once I heard that at one time IBM was the second largest publisher in the world, right after the US Government Publishing Office (or whatever it’s called). They did a business case and determined that clear, consistent formatting would save writers’ time and customer support time sufficiently to make the development costs worthwhile.
Development consisted of two months of part-time haggling over the architecture among the IBM team, and a few days of development for the DTD/mod files and the XSL for the toolkit.
Development went through the rigorous IBM process required to develop specializations. Boy, I sure love working for a small company!
The demoed pages and the description of the tagging looked like it would work for a basic, flat problem/resolution scenario. The XSLT provides standard labels for some of the section types, where for other section type(s?) there is a “label” tag that allows customized labeling.
For more complex decision-tree troubleshooting it would not work. Apparently this is Phase 2, currenlty under development, and Dan Dionne looked considerably more excited by the technical challenges in this phase than in the first phase. As he put it, “Specialization is absurdly easy.” Of course, he then followed this up with “Specializing DITA attributes is a shaky business.”
Hmm. Having just done same for a customer recently, I began to feel a little shaky.
One audience member opined that it would have been nice if IBM had removed the IBM-specific terms from the specialization before donating it to OASIS. The response was along the lines of “that effort did not fit into their business justification and they would applaud the contribution of anyone who wanted to expend the effort to make further sub- or up-specializations and donate it to OASIS.”
Well, sure, why not. It’s “absurdly easy.”
And that’s about it for tonight’s entries. I’ll try to catch up on today’s other four sessions tomorrow afternoon when the sessions have ended.

CMS/DITA Conference: Session 6, Staged Production and Quality Documentation within a DITA-Based CMS
April 8th, 2008 by ScriptoriumTech
by David Kelly
Presented by Keith Schengili-Roberts from AMD, Inc., this session told the story of a successful implementation of a DITA/CMS solution. And that’s always a good kind of story to hear.
I don’t think there was anything unusual about the problem they were trying to solve. One thing it had in common with all the successful implementations I’ve heard about is that they had to translate their documentation into a lot of different languages — 25 or so. At lunch I asked the folks sitting at my table whether they were aware of any CMS implementations that had been justified by anything other than localization savings. No takers.
One analysis presented was why the addition of more writers was not going to solve the problem. I don’t have all the details of the analysis at hand, and this is a blog, where brevity is a virtue (though I do love to write in Russian novelist mode), so you’ll have to take his word for it. It doesn’t.
Some keys to their success:
* Meeting customers, getting their requirements for documents and delivery.
* Buy-in from management
* Working with the writers on the cognitive change to topic-based writing (training, etc.).
* Providing time for the writers to handle the extra overhead.
* Clear definition of roles. Engineers became responsible for correctness and completion of content, writers became responsible for clearness and consistency of content. To this point he mentioned that DITA allows tagging in multiple ways; editors are needed for tagging style.
He enumerated a large number of advantages that DITA provided, which I won’t repeat here. They chose Ixiasoft for the CMS, who later productized the customer DITA solution they developed for AMD (hope AMD got a price break). He did not mention they reasons for choosing Ixiasoft as far as I recall.
For an editor he began with the statement, “WYSIWYGs are detrimental to DITA.” This brought a rousing cheer of approval from Bruce Nevin, a Cisco Information Architect who was sitting right behind me. Keith said they chose oXygen, which I should have cheered since that is my own personal XML editor of choice. But then I do a lot of XSL development and I’ve always thought it was a little geeky for the average tech writer. Maybe Keith’s tech writers are like the folks in Lake Woebegon, Minn., who Garrison Keilor claims are all “above average.” Although when they had WYSIWYG editors, Keith said they were spending as much as 50% of their time on formatting. Whew, and I thought I was picky.
Keith presented several statistics to substantiate his claims of success. He was able to do this by relying on the reporting tools from the CMS, which he stated were significant aids to management in general. The statistics he presented were, for the most part, fairly abstract, but a couple stuck. One was the savings of 10% to 13% in localization costs. He admitted this was conservative, and it is indeed low compared to other claims I have heard, but he had included the purchase of Unicode fonts (Unicode in general was a big help) and the original conversion costs.
u
One success that he did not expect was that they doubled their output and halved the median project time. The localization savings were expected, but not the development savings.
In general this was a clear, astute, nuts and bolts presentation that gave a lot of information about the inner workings of a CMS/DITA implementation. Mr. Schengili-Roberts is obviously fond of metrics and objective data, so I suspect his claims were not exaggerated.

CMS/DITA Session 5: The Emergence of Dynamic Documents
April 8th, 2008 by ScriptoriumTech
by David Kelly
Originally planned to be presented by Paul Wlodarczyk, this presentation was given by Amber Swope of Just Systems.
This session presented the possibility of structured documents that support a dynamic data link (I don’t think I mean that in the traditional sense) between the authoring/presentation layer and an underlying database. The purpose is to make documents “smarter” by allowing more interaction between the data and the user. One example would be something like a drop-down list inside a certain tag (say, a specialized market-segment
In general this appeared to be a presentation of a solution that would need to be heavily customized by a vendor. It could be implemented for DITA or any other XML/structured data. But the concept certainly provides stimulus for thinking of a lot of possibilities. It’s late now and I don’t think I can think of any at the moment.
More tomorrow.

CMS/DITA Conference Session 4: 7 Reuse Strategies for DITA
April 8th, 2008 by ScriptoriumTech
by David Kelly
This session was presented by France Baril from Ixiasoft. I think by this time in the afternoon I was starting to need a cheeseburger fix – and not a fast food joint in sight. For each point Ms. Baril presented what it was, when to use it, how to do it, and what to watch out for. Here I will only give a brief description of the “whats” in hopes that the curious will ping me (or her) about the whys and hows.
The 7 types of re-use enabled by DITA that she presented were:
1. Multiple output types from one source. XML to PDF, HTML, etc.
2. Using the same topic in different projects and contexts. Basically, multiple DITA maps can point to the same topic, but in different orders or with different selections of companion topics.
3. Regular conditional text/filtering. This is the standard DITA mechanism of specifying an attribute and value on a topic (or other tags), then including or excluding that text based on a setting in a ditaval file.
4. “Systematic” conditional text/filtering. I was not familiar with this term. I somewhat lost the definition, but it appears to involve the use of DITA-specialized elements combined with custom XSL. The example was the use of a
5. Conrefs – this involved the standard DITA use of conrefs, which basically involve creating a piece of text in one place, then “conreffing” it from another. In the final document the conref is replaced by the reference text. See my comment in an earlier entry about thinking about conrefs as contextless, stand-alone units of information. (Hmm, maybe if this blogging application application supported DITA I could have used a conref!)
6. Variables for words and phrases. She mentioned the
7. Automatic content creation. This one involves using some kind of processing system to essentially “reap” the source one or more times to create a secondary document. Some examples were references to Doxygen and Javascript, which make selections from programming source code to create documentation. She also mentioned the example of creating a table of contents in a Word document. And in fact I have written XSL to generate entire chapters that consist of, essentially, linked headings to descriptions of software commands, which are organized according to different criteria in each chapter. One source, many outputs, but in this case they are in the same format in the same document. And a host of other possibilities are out there…

CMS/DITA Conference Session 3: A Democratic Approach to DITA/CMS Issues
April 7th, 2008 by ScriptoriumTech
by David Kelly
This session was presented by Sarah-Beth Doner and Laura Hood of RIM (the Blackberry people). I earnest beg indulgence not to present everything they talked about since there were two of them and they covered quite a bit of ground. Enthusiastically and authoritatively, I might add.
The central theme here was the culture shift required to get buy-in for a major change from being a Framemaker shop to becoming a CMS/DITA shop. Initially they made the change with a small committee selected from the first adopters, and after a small rollout project they began a larger rollout. Here they encountered pushback. Their solutions involved setting up groups and meetings to get broader involvement in the rollout and implementation throughout the entire user community. Apparently they had strong management support to enable the users to spend time on addressing the issues. As mentioned in a previous entry, they also found it necessary to begin redefining roles. Some roles they mentioned were mentor, support, tool development, and content enhancement.
Asked whether the democratic approach had resulted in any decisions that in retrospect they might regret, they agreed that there were instances where they made the wrong decisions, where tools were developed that they discarded, etc. But they found that it was important to keep everyone involved and informed to be able to make the system work – otherwise, apparently, the pushback was difficult to deal with.
Going forward, they are now at the point where enhancements and automation are the big pushes – the basics appear to be solved. (I believe they said they had been doing this for two and a half years.)
The benefits were mostly in reduced translation costs, the ability to scale from 8 language to 40 languages, the ability to customize documents per customer, and the quickness of turnaround time.
Another interesting question they answered: did the democratic process ever fail – how did they come to a decision when there were deadlocks, floundering, and indecision? They said that in general they had deadlines to meet, management expect certain answers by certain times, and if nothing else they had the managers to fall back on when they couldn’t arrive at agreement on their own.
(As a former documentation manager, I am pleased to hear that benign despots occasionally have their moments.)

CMS/DITA Session 2: Veni, Vidi, Wiki
April 7th, 2008 by ScriptoriumTech
by David Kelly
This session had a longish subtitle about open source wikis changing the face of technical publishing. It was presented by Peter Dykstra of MetaphorX, which is his consulting company.
The central point of this presentation seemed to be that Wikis had progressed to where they could be an easy entry into the world of CMSs, collaborative authoring, and XML -based outputs.
After an introduction to Wikis in general, he talked about the current state of Wikis. He listed the PHP-based Wikis (which he described as “cool”) and the Java-based Wikis (“IT friendly”). The list of functions now present in Wikis was surprisingly large – change management, versioning, role support, database back-ends, and infrastructure friendliness were some of the features I managed to jot down. (My notes also say “spoke very quickly.”) (And again, the salient talking points should be in the conference materials.) My sense of the functionality was that the Wikis should be considered more “document management” than “content management,” although he hinted that with planning and effort, a Wiki “document” could be fairly small, and therefore the management of documents could be fairly granular. (I eagerly await a robust demonstration of this assertion.)
He mentioned that he sees companies becoming much more receptive to open source applications than formerly. Wikis are becoming more popular, he says, due to the uncontrolled sprawl of Intranets. I’ve certainly seen THAT in more than one place. Of course, he also mentioned the sprawl of older Wiki sites he had seen. Supposedly the newer CMS-type functions help to control this problem.
With the use of a Wiki, he sees the opening of audience access to content and more collaborative information development. With the use of roles-based permissions, the writer can remain in control of the final output, but still take advantage of the richer input. One idea he suggested also came through in most of the sessions I attended – the changing roles of technical documentation professionals. In this case, the writer becomes more an editor than a content creator. He mentioned other roles – moderator, producer, online editor…
Finally, he gave a demo of Daisy, which is an open source Wiki (there are proprietary Wikis also available, in addition to the many open source Wikis.). It appeared to have some basic “book-like” features for concatenating topics, and the ability to define metadata for topics, and some conditional-type controls for topic inclusion/exclusion. What I call “topics” here were actually Wiki documents that he had defined to look like DITA topics. As far as he is aware, there is no full DITA solution currently available in an open source CMS.
So for all you Wiki open source developers out there, it looks like the field is wide open. Piece o’ cake!

Session 1 from the CMS/DITA Convention – Best Practices for DITA in a Global Economy
April 7th, 2008 by ScriptoriumTech
by David Kelly
I am pleased to report that I made an error about making an error. Do those cancel out to mean I didn’t make any errors, or does it mean… well, never mind. Suffice it to say I won’t be able to blog “live” from the convention center during this convention, so I’ll be catching up in the evenings from paper notes. No hasty indiscrections here, I’m afraid!
I will try to keep it to one entry per session to make the blog more “topic oriented.”
Session 1, given by Eric Severson, CTO of Flatirons Solutions, had the promising title “Best Practices for DITA in a Global Economy.” Possibly due to the extensive (i.e., painful) experience I’ve had with DITA the past several years, I was disappointed to find that two thirds of the presentation was an introduction to DITA. This was surprising since Eric had asked for a show of hands from those who had DITA experience, and probably two thirds of the room raised their hands.
The final third of the presentation focused on re-use of information and the kinds of issues it raises for localization. He recommended a process similar to the one his company uses, which they call a “DITA Maturity Model.” This process is used to analyze legacy content to determine what kind of rewriting/restructuring is needed when converting to topic-oriented DITA source. One part of this involves a “content architecture” or information architecture phase to identify reusable information. This involves a fairly granular examination of the text to determine what content is identical, or close to identical, so it can be put into a form for optimum re-use. By all accounts I heard today, the payoff for this extensive, up-front architectural work is reduced translation costs and ease of maintenance. More about that in a later entry.
Some tips he passed along: When writing topics for reusability, think about them as having no context and therefore having to stand alone. Try to stick to the standard form of DITA if possible — “Specializations can cause problems” for localization. He also recommended not using conrefs haphazardly, but thinking from a perspective of “editorial integrity.” By this he meant avoiding conrefs to fragments of information, but making them more context-free, stand-alone chunks.
And even though I complain about the long introduction, I did later overhear someone saying she was glad he gave the introduction, otherwise she would not have understood the rest of what he was talking about.

CMS/DITA Conference Keynote Notes
April 7th, 2008 by ScriptoriumTech
by David Kelly
It appears that wireless access is available after all – a theory I have not yet tested, since I am now in the hotel room while they set up the conference rooms for the different tracks. We’ll attribute my earlier failure to connect to “user error.” I know, difficult to conceive…
JoAnn Hackos delivered the keynote speech for this, the 10th CMS conference, spending a good bit of time on the history of the development of content management, from the old days of human “code librarians” to the present. I won’t repeat her points, since I think they will be published elsewhere, but her take is that some of the current drivers for CMS implementation are:
* Minimizing the inventory of information for efficiency and ease of re-use.
* The need for collaboration rather than individual ownership of information.
* The need to capture and preserve the meaning/purpose of information by means of metadata.
* The need to secure, version, and compare versions of data.
She believes that these needs are starting to drive larger organizations toward “global enterprise content management” (GECM). She sees a movement toward wider adoption of CMSs – and standards – in the industry, a point emphasized by the 285 attendees and 17 countries represented at the conference.
Concerning the future, she did not offer any comments, but invited the audience to contribute thoughts about their needs to the conference blog (at www.cidm.com).
Having now delivered the first new-to-me acronym and a fuzzy, open-ended comment about the future, I will stroll down to the conference center again to try out the wireless connection. Take a deep breath…

Content Management Strategies 2008 Conference – Greeting
April 7th, 2008 by ScriptoriumTech
by David Kelly
Greetings from lovely uptown Santa Clara, CA, where I’m attending the CMS/DITA North America Conference. I plan to make entries as often as possible, as it appears there will be a lot to hear and talk about. One limitation will be that there appears to be no wireless network coverage in the convention center, so the entries will be more sporadic than I had planned.
The attendance is planned to be around 280 people, and there are a lot of vendors from the content authoring, CMS, and services/consulting arenas. I’ve spoken with people from Vasont, CSoft, Ovitas, and Empolis so far, and we’re still 30 minutes from the conference being kicked off.
So far the consensus seems to be that DITA and CMSs are becoming well integrated. There are still issues of migration, seelcting vendors, specializations for various industries, and choices of authoring environments, among others. I’ll be keeping my ears open for perspectives on these issues, and I’ll pass along what I learn — as network access permits.

Integrating XML and FrameMaker
December 31st, 2007 by ScriptoriumTech
| NOTE: | This white paper is based on FrameMaker 7. |
FrameMaker provides solid support for XML-based authoring workflows. Its PDF and print output capabilities are stellar. Because FrameMaker combines authoring and publishing in a single application, configuring XML support can be quite challenging.
It sounds too good to be true: store information in an application-independent, platform-independent format and then render that information through the software of your choice. XML does, in theory, deliver on this promise, but implementation is rarely as straightforward as you might hope. (more…)

Open-Source Sunrise?
May 25th, 2005 by ScriptoriumTech
Jim Rapoza at eWeek had an interesting suggestion in regard to the Adobe/Macromedia merger. He notes that the sorting and sifting that follows most software mergers leads to some products fading away. Some actually get a funeral but often there is simply a loss of interest. Rapoza suggests turning over these orphans to the open-source community.
This probably sounds like folly. Companies just don’t do this sort of thing. However, IBM has certainly demonstrated that moving technology to the open source community can be a valid business strategy.
I have seen an acquiring company become “a deer in the headlights”–as the cost of maintaining/enhancing a product is weighed against the cost of offending customers by killing it. The latter cost isn’t taken lightly since these customers probably license software from both companies. It is not unusual to see critical business applications and processes that have been built around the specialty tools that are most likely the targets of such “alignments”. The integration investment often significantly exceeds that of the underlying tools.
Just a few short years ago this wasn’t even an option. Perhaps Rapoza is onto something.

Clip, clop…RoboHelp rides off into the sunset
April 29th, 2005 by ScriptoriumTech
At the WritersUA conference last March, Macromedia cancelled participation in the trade show at the last minute. Immediately, rumors began flying (although in fairness we have to say that the Adobe/Macromedia merger was not one of the myriad conspiracy theories that emerged). Before the conference ended, a content-free Macromedia statement appeared in a RoboHelp forum at Macromedia’s site.
The apparent demise of RoboHelp matches the general industry trend over the last two years. Technical publishing groups are beginning to demand that tools support open standards (XML and XSL) that offer greater flexibility.
We are being asked to produce more and varied forms of output (beyond the basic print and online help), to share content, and to extend publishing workflows to include other departments and organizations. Software customization based upon user profiles and authorization is creating the need for user assistance that accommodates a complex matrix of variations in both feature sets and user interfaces.
These demands make a transition from proprietary solutions to open standards-based tools quite appealing. Structured authoring based on XML can address all of these requirements, and XSLT is quickly becoming a popular tool for transforming data from a broad variety of applications. This is where many of our clients are moving and we are adding classes (XSLT) and offering products (DocFrame) to meet the demand.

