
Conversion ChallengesHuman resource is the most expensive part of most XML conversion processes. The main challenges that influence the amount of human intervention required during conversion, and hence the cost, are: Complexity of source materialThree broad document categories indicate the increasing difficulty of conversion, and hence the amount of human intervention (and thus cost):
For additional information on this subject, see Document Complexity and XML Conversion Consistency of source materialContent authors and contributors typically use a variety of tools and formats (word processors, desktop publishing applications, presentation software, etc.) to create and store information in many different file formats. This can complicate the conversion process. For example, a user can construct a simple indented bulleted list item by:
For each of these methods, a different set of typographical codes will be generated. In addition, different word processors and typesetting applications will emit different codes for the same user actions. Conversion software based on rules therefore requires human programming resource to take account of every input variation, or low quality XML will result. See our paper entitled: There are no unstructured documents for more details. Type of markup requiredThe intended application of the XML output will dictate the type of markup required. The more sophisticated the markup, the more versatile it is, and the more human resource will be required in its production. Three distinct and increasingly useful levels of markup are:
For more information, see Classes of XML Output. |
More InfoRefresher
Context
Videos |