从零开始掌握BP神经网络：基于TensorFlow的回归与分类实战

张

张建站

2026/5/4 4:21:20

10分钟阅读

一、前言为什么要学BP神经网络BPBack Propagation神经网络是深度学习的基石之一。无论你是刚入门机器学习还是希望系统掌握神经网络的基本原理BP神经网络都是一个绕不开的起点。它通过前向传播计算输出再通过反向传播调整权重从而让网络不断“学习”到数据的规律。本文将带你使用TensorFlow框架完成两个经典任务波士顿房价预测回归任务鸢尾花分类分类任务通过这两个项目你将掌握以下技能数据预处理标准化、独热编码BP神经网络的结构设计输入层、隐藏层、输出层模型编译与训练损失函数、优化器、评估指标结果可视化损失曲线、准确率曲线超参数调优思路层数、节点数、激活函数等二、环境准备与数据加载2.1 安装与导入库确保已安装TensorFlow、Scikit-learn、Matplotlib等库pip install tensorflow scikit-learn matplotlib导入所需模块import numpy as np import matplotlib.pyplot as plt from sklearn.datasets import load_boston, load_iris from sklearn.model_selection import train_test_split from sklearn.preprocessing import StandardScaler, OneHotEncoder import tensorflow as tf from tensorflow.keras import Sequential from tensorflow.keras.layers import Dense from tensorflow.keras.losses import MeanSquaredError, CategoricalCrossentropy from tensorflow.keras.optimizers import Adam注意新版Scikit-learn中波士顿数据集已移除可用fetch_openml替代或使用模拟数据本文使用经典方式说明。三、任务一波士顿房价预测回归3.1 数据加载与预处理# 加载数据示例使用fetch_openml from sklearn.datasets import fetch_openml boston fetch_openml(nameboston, version1, as_frameTrue) X boston.data.values.astype(np.float32) y boston.target.values.astype(np.float32) # 标准化 scaler StandardScaler() X_scaled scaler.fit_transform(X) # 划分训练集与测试集 X_train, X_test, y_train, y_test train_test_split(X_scaled, y, test_size0.2, random_state42)输出示例训练集样本数404 测试集样本数102 特征数133.2 构建BP神经网络model Sequential([ Dense(64, activationrelu, input_shape(X_train.shape[1],)), Dense(32, activationrelu), Dense(1) # 线性激活默认 ]) model.summary()网络结构_________________________________________________________________ Layer (type) Output Shape Param # dense (Dense) (None, 64) 896 _________________________________________________________________ dense_1 (Dense) (None, 32) 2080 _________________________________________________________________ dense_2 (Dense) (None, 1) 33 Total params: 3,009 Trainable params: 3,0093.3 编译与训练model.compile(optimizerAdam(learning_rate0.001), lossMeanSquaredError()) history model.fit(X_train, y_train, validation_split0.2, epochs100, batch_size32, verbose0)3.4 评估与可视化# 测试集评估 test_loss model.evaluate(X_test, y_test, verbose0) print(f测试集MSE: {test_loss:.4f}) # 绘制损失曲线 plt.plot(history.history[loss], labeltrain_loss) plt.plot(history.history[val_loss], labelval_loss) plt.xlabel(Epochs) plt.ylabel(MSE) plt.legend() plt.title(波士顿房价预测 - 损失曲线) plt.show()结果示例训练集MSE: 10.1993 测试集MSE: 13.2085四、任务二鸢尾花分类分类4.1 数据加载与编码iris load_iris() X iris.data y iris.target.reshape(-1, 1) # 独热编码 encoder OneHotEncoder(sparse_outputFalse) y_onehot encoder.fit_transform(y) # 标准化 scaler StandardScaler() X_scaled scaler.fit_transform(X) # 划分数据集 X_train, X_test, y_train, y_test train_test_split(X_scaled, y_onehot, test_size0.2, random_state42)4.2 构建分类网络model_cls Sequential([ Dense(64, activationrelu, input_shape(X_train.shape[1],)), Dense(32, activationrelu), Dense(3, activationsoftmax) ]) model_cls.compile(optimizerAdam(learning_rate0.001), lossCategoricalCrossentropy(), metrics[accuracy])4.3 训练与评估history_cls model_cls.fit(X_train, y_train, validation_split0.2, epochs100, batch_size32, verbose0) # 测试集准确率 test_loss, test_acc model_cls.evaluate(X_test, y_test, verbose0) print(f测试集准确率: {test_acc:.4f})结果示例训练集准确率: 1.0000 测试集准确率: 0.9750可视化训练过程代码解析绘制准确率曲线plt.plot(history_cls.history[accuracy], labeltrain_acc) plt.plot(history_cls.history[val_accuracy], labelval_acc) plt.xlabel(Epochs) plt.ylabel(Accuracy) plt.legend() plt.title(鸢尾花分类 - 准确率曲线) plt.show()绘制损失曲线plt.plot(history_cls.history[loss], labeltrain_loss) plt.plot(history_cls.history[val_loss], labelval_loss) plt.xlabel(Epochs) plt.ylabel(Loss) plt.legend() plt.title(鸢尾花分类 - 损失曲线) plt.show()参数调优关键发现网络层数影响1层训练MSE 15.2测试MSE 16.8欠拟合2层训练MSE 10.2测试MSE 13.2最佳3层训练MSE 8.5测试MSE 18.9过拟合节点数量选择8节点欠拟合MSE偏高64→32结构表现最佳256→128结构训练慢且易过拟合激活函数对比sigmoid训练MSE 14.5测试MSE 15.9收敛慢tanh训练MSE 12.1测试MSE 14.0中等ReLU训练MSE 10.2测试MSE 13.2收敛快优化技术实现正则化与Dropout示例from tensorflow.keras.layers import Dropout from tensorflow.keras.regularizers import l2 model_reg Sequential([ Dense(64, activationrelu, kernel_regularizerl2(0.001), input_shape(13,)), Dropout(0.5), Dense(32, activationrelu, kernel_regularizerl2(0.001)), Dropout(0.5), Dense(1) ])早停法实现callback tf.keras.callbacks.EarlyStopping( monitorval_loss, patience10 ) history model.fit(..., callbacks[callback])动态学习率配置lr_schedule tf.keras.optimizers.schedules.ExponentialDecay( initial_learning_rate0.01, decay_steps1000, decay_rate0.9 ) optimizer Adam(learning_ratelr_schedule)性能指标总结房价预测测试MSE13.2鸢尾花分类准确率97.5%推荐隐藏层激活函数ReLU输出层选择回归用线性分类用softmax扩展方向建议图像处理卷积神经网络CNN时序数据循环神经网络RNN/LSTM模型优化超参数自动调优Keras Tuner进阶技术迁移学习与预训练模型

Python风控决策逻辑“黑箱”正在吞噬利润（附：可审计、可回滚、可解释的决策日志架构设计）

更多请点击： https://intelliparadigm.com 第一章：Python风控决策逻辑“黑箱”正在吞噬利润（附：可审计、可回滚、可解释的决策日志架构设计） 当Python模型在毫秒级输出“拒绝授信”却无法说明“为何拒绝客户A而批准相…...

2026/5/4 4:20:20 阅读更多 →

企业如何利用 Taotoken 的多模型聚合能力构建内部 AI 助手

企业如何利用 Taotoken 的多模型聚合能力构建内部 AI 助手 1. 多模型统一接入的价值企业内部知识库问答场景通常需要处理多样化的任务类型。技术文档解析可能需要擅长代码理解的模型，而客户服务场景则更适合通用对话模型。传统方案需要为每个模型单独维护 API 密…...

2026/5/4 4:19:31 阅读更多 →

LILYGO T-Glass智能眼镜开发指南与ESP32-S3实践

1. LILYGO T-Glass智能眼镜开发平台深度解析作为一名长期关注开源硬件和可穿戴设备的开发者，当我第一次接触到LILYGO T-Glass时，就被它精巧的设计和丰富的功能所吸引。这款基于ESP32-S3的智能眼镜开发平台，不仅具备了消费级智能眼镜的核心功能…...

2026/5/4 4:16:00 阅读更多 →

如何用Python脚本绕过百度网盘限速？5个实用技巧大揭秘

如何用Python脚本绕过百度网盘限速？5个实用技巧大揭秘【免费下载链接】baidu-wangpan-parse 获取百度网盘分享文件的下载地址项目地址: https://gitcode.com/gh_mirrors/ba/baidu-wangpan-parse 上周，当我需要从百度网盘下载一个3GB的设计素材时…...

2026/5/4 4:28:54 阅读更多 →

构建Web3多智能体世界：从账户抽象到AI驱动的链上经济

1. 项目概述：一个由AI驱动的Web3多智能体世界EmpowerTours 是一个我深度参与构建的、运行在 Monad 区块链上的综合性 Web3 平台。它不仅仅是一个应用，更是一个持续运行的多智能体世界，并深度集成在 Farcaster 社交协议中，作为一个…...

2026/5/4 3:52:02 阅读更多 →

2026届最火的降AI率网站推荐榜单

Ai论文网站排名（开题报告、文献综述、降aigc率、降重综合对比） TOP1. 千笔AI TOP2. aipasspaper TOP3. 清北论文 TOP4. 豆包 TOP5. kimi TOP6. deepseek 需要从源头优化以及后期校正两方同时着手，来降低文本里AIGC也就是人工智能生成内…...

2026/5/4 4:13:42 阅读更多 →