C++读取包含图片和格式的html文件中的内容

要读取包含图片和格式的.html文件中的内容，您可以使用C++中的文件流来读取文件，并使用适当的库来解析HTML内容和提取包含的图片和格式信息。

以下是一个示例代码，使用C++标准库和htmlcxx库来读取和解析HTML文件。

#include <iostream>
#include <fstream>
#include <string>
#include <htmlcxx/html/ParserDom.h>

int main() {
    // 读取HTML文件
    std::ifstream file("example.html");
    std::string content((std::istreambuf_iterator<char>(file)),
                         std::istreambuf_iterator<char>());

    // 创建HTML解析器
    htmlcxx::HTML::ParserDom parser;
    tree<htmlcxx::HTML::Node> dom = parser.parseTree(content);

    // 遍历DOM树，提取图片和格式信息
    for (tree<htmlcxx::HTML::Node>::iterator it = dom.begin(); it != dom.end(); ++it) {
        if (it->tagName() == "img") {
            // 提取图片URL
            std::string imgUrl = "";
            tree<htmlcxx::HTML::Node>::iterator imgIt = it->begin();
            for (; imgIt != it->end(); ++imgIt) {
                if (imgIt->tagName() == "src") {
                    imgUrl = imgIt->text();
                    break;
                }
            }
            std::cout << "Image URL: " << imgUrl << std::endl;
        }
        else if (it->tagName() == "style" || it->tagName() == "link") {
            // 提取格式信息
            std::string format = "";
            if (it->tagName() == "style") {
                format = it->text();
            }
            else if (it->tagName() == "link") {
                tree<htmlcxx::HTML::Node>::iterator attrIt = it->begin();
                for (; attrIt != it->end(); ++attrIt) {
                    if (attrIt->tagName() == "href") {
                        format = attrIt->text();
                        break;
                    }
                }
            }
            std::cout << "Format: " << format << std::endl;
        }
    }

    return 0;
}

在上述示例中，example.html是包含图片和格式的HTML文件的路径。代码读取文件内容并使用HTML::ParserDom解析HTML内容。然后，它遍历DOM树并查找img标签和style或link标签来提取图片和格式信息。最后，它打印出提取的图片URL和格式信息。

请注意，此示例使用了htmlcxx库（https://github.com/lexborisov/htmlcxx），因此您需要在项目中包含该库的头文件和链接该库。您可以根据您的项目需求进行适当的调整