HTML是一种描述富文本的可能性。作为WWW的文件格式,它是完善的。恕我直言,可能任何现代的富文本文本处理工具都支持它。 (我个人知道WinWord –多年。)
编写HTML文件相当容易,因为HTML文件实际上不是源代码,它可能是用纯ASCII编写的。
简短演示print-HTML.c
:
#include <stdio.h>
struct Entry {
const char *author;
const char *title;
};
void printEntry(FILE *f, struct Entry *pEntry, int i)
{
fprintf(f,
"<tr><!-- start of table row -->\n"
"<td>%d</td><!-- number -->\n"
"<td>%s</td><!-- Author -->\n"
"<td>%s</td><!-- Title -->\n"
"</tr><!-- end of table row -->\n",
i, pEntry->author, pEntry->title);
}
void printTable(FILE *f, size_t nEntries, struct Entry table[])
{
fprintf(f,
"<table><!-- start of table -->\n"
"<tr><!-- start of table head row -->\n"
"<th>No.</th><th>Author</th><th>Title</th>\n"
"</tr><!-- end of table head row -->\n");
for (size_t i = 0; i < nEntries; ++i) {
printEntry(f, table + i, (int)i + 1);
}
fprintf(f,
"</table><!-- end of table -->\n");
}
void printDoc(
FILE *f, const char *title, size_t nEntries, struct Entry table[])
{
fprintf(f,
"<!DOCTYPE html>\n"
"<html>\n"
"<head>\n"
"<title>%s</title>\n"
"</head>\n"
"<body>\n"
"<h1>%s</h1>\n",
title, title);
printTable(f, nEntries, table);
fprintf(f,
"</body>\n"
"</html>\n");
}
int main()
{
/* the sample table */
struct Entry table[] = {
{ "Kernighan and Ritchie", "The C Programming Language" },
{ "Kernighan and Ritchie", "Programming in C" },
{ "Tim Berners-Lee", "Weaving the Web" },
{ "Tim Berners-Lee", "Hypertext Markup Language: the HTML explained from the Inventor of the WWW" }
};
enum { nEntries = sizeof table/sizeof table[0] };
/* output as HTML */
printDoc(stdout, "My Favorite Books", nEntries, table);
/* done */
return 0;
}
样品会话:
火狐:
$ gcc -std=c11 -o print-HTML print-HTML.c
$ ./print-HTML
<!DOCTYPE html>
<html>
<head>
<title>My Favorite Books</title>
</head>
<body>
<h1>My Favorite Books</h1>
<table><!-- start of table -->
<tr><!-- start of table head row -->
<th>No.</th><th>Author</th><th>Title</th>
</tr><!-- end of table head row -->
<tr><!-- start of table row -->
<td>1</td><!-- number -->
<td>Kernighan and Ritchie</td><!-- Author -->
<td>The C Programming Language</td><!-- Title -->
</tr><!-- end of table row -->
<tr><!-- start of table row -->
<td>2</td><!-- number -->
<td>Kernighan and Ritchie</td><!-- Author -->
<td>Programming in C</td><!-- Title -->
</tr><!-- end of table row -->
<tr><!-- start of table row -->
<td>3</td><!-- number -->
<td>Tim Berners-Lee</td><!-- Author -->
<td>Weaving the Web</td><!-- Title -->
</tr><!-- end of table row -->
<tr><!-- start of table row -->
<td>4</td><!-- number -->
<td>Tim Berners-Lee</td><!-- Author -->
<td>Hypertext Markup Language: the HTML explained from the Inventor of the WWW</td><!-- Title -->
</tr><!-- end of table row -->
</table><!-- end of table -->
</body>
</html>
$ ./print-HTML >test.html
$
下面,应用程序的一些快照我打开test.html
MS Word for Windows中:
MS Excel中:
更新:
在上面的示例代码,我仔细防止使用元字符(<
,>
,&
和"
)。如果这些字符出现在原始文本中,则它们可能不会按原样打印(因为这些字符在HTML语法中可能有特殊含义)。相反,他们必须通过自己的实体取代:
<
⇒ <
(开始标记)
>
⇒ >
(标记结束)
&
⇒ &
(开始实体)
"
⇒ "
(引用属性值的开始/结束)
'
⇒ '
(引用属性值的替代开始/结束)。
在HTML中,还有很多预定义的实体。 (在XML中,这些是唯一预先定义的实体。)
更新示例代码:
#include <stdio.h>
void printHTMLText(FILE *f, const char *text)
{
for (; *text; ++text) {
switch (*text) {
case '<': fprintf(f, "<"); break;
case '>': fprintf(f, ">"); break;
case '&': fprintf(f, "&"); break;
case '"': fprintf(f, """); break;
case '\'': fprintf(f, "'"); break;
default: putc(*text, f);
}
}
}
struct Entry {
const char *author;
const char *title;
};
void printEntry(FILE *f, struct Entry *pEntry, int i)
{
fprintf(f,
"<tr><!-- start of table row -->\n"
"<td>%d</td><!-- number -->\n"
"<td>",
i);
printHTMLText(f, pEntry->author);
fprintf(f,
"</td><!-- Author -->\n"
"<td>");
printHTMLText(f, pEntry->title);
fprintf(f,
"</td><!-- Title -->\n"
"</tr><!-- end of table row -->\n");
}
void printTable(FILE *f, size_t nEntries, struct Entry table[])
{
fprintf(f,
"<table><!-- start of table -->\n"
"<tr><!-- start of table head row -->\n"
"<th>No.</th><th>Author</th><th>Title</th>\n"
"</tr><!-- end of table head row -->\n");
for (size_t i = 0; i < nEntries; ++i) {
printEntry(f, table + i, (int)i + 1);
}
fprintf(f,
"</table><!-- end of table -->\n");
}
void printDoc(
FILE *f, const char *title, size_t nEntries, struct Entry table[])
{
fprintf(f,
"<!DOCTYPE html>\n"
"<html>\n"
"<head>\n"
"<title>");
printHTMLText(f, title);
fprintf(f,
"</title>\n"
"</head>\n"
"<body>\n"
"<h1>");
printHTMLText(f, title);
fprintf(f,
"</h1>\n");
printTable(f, nEntries, table);
fprintf(f,
"</body>\n"
"</html>\n");
}
int main()
{
struct Entry table[] = {
{ "Kernighan & Ritchie", "The C Programming Language" },
{ "Kernighan & Ritchie", "Programming in C" },
{ "Tim Berners-Lee", "Weaving the Web" },
{ "Tim Berners-Lee", "Hypertext Markup Language: the HTML explained from the Inventor of the WWW" }
};
enum { nEntries = sizeof table/sizeof table[0] };
printDoc(stdout, "My Favorite Books", nEntries, table);
return 0;
}
将打印例如
{ "Kernighan & Ritchie", "The C Programming Language" }
为:
<td>Kernighan & Ritchie</td><!-- Author -->
<td>The C Programming Language</td><!-- Title -->
注:
"
实际上是在只有双引号中的属性值替换。 (以及单引号属性值中的'
)。反过来,<
和>
不需要在属性值中被替换。为了使事情简单而紧凑,功能printHTMLText()
取代了这些字符中的任何一个。
from * this * function?好吧,根本不是。但是,当然你可以编写文件,掌握OOXML规范并做相应的输出(提示:这是一个**大量的工作,你的“功能”可能会增加到几个模块) –
什么是' a',为什么你期望每个数据元素都具有相同的确切大小(包括终止零)?这与编写一个Word文件有什么关系? –
@Felix Palmen你会给我一些链接要遵循吗? – moh89