为了正常的体验网站,请在浏览器设置里面开启Javascript功能!

用BP神经网络算法和改进的BP神经网络算法研究预测电渗析过程的分离百分比-外文翻译

2017-09-28 27页 doc 86KB 12阅读

用户头像

is_954223

暂无简介

举报
用BP神经网络算法和改进的BP神经网络算法研究预测电渗析过程的分离百分比-外文翻译用BP神经网络算法和改进的BP神经网络算法研究预测电渗析过程的分离百分比-外文翻译 用BP神经网络算法和改进的BP神经网络算法研究预测 电渗析过程的分离百分比 摘要 在电渗析过程中,分离百分比(SP)与一些影响因素(进料浓度(C)、稀室的流通率(Q)、反应温度(T)和应用电压(V))有非线性关系,并且这些关系很难用一个简单的公式来表示。四个影响因素对SP产生了显著影响。在这篇论文中,对四个因素进行了电渗析实验研究。反向传播(BP)神经网络算法和改进BP算法被用于SP的预测,并且他们的预测能力可以反映出他们在非线性关系的复...
用BP神经网络算法和改进的BP神经网络算法研究预测电渗析过程的分离百分比-外文翻译
用BP神经网络算法和改进的BP神经网络算法研究预测电渗析过程的分离百分比-外文翻译 用BP神经网络算法和改进的BP神经网络算法研究预测 电渗析过程的分离百分比 摘要 在电渗析过程中,分离百分比(SP)与一些影响因素(进料浓度(C)、稀室的流通率(Q)、反应温度(T)和应用电压(V))有非线性关系,并且这些关系很难用一个简单的公式来示。四个影响因素对SP产生了显著影响。在这篇论文中,对四个因素进行了电渗析实验研究。反向传播(BP)神经网络算法和改进BP算法被用于SP的预测,并且他们的预测能力可以反映出他们在非线性关系的复杂数据上的适应能力。使用不同的神经网络结构、神经元传递函数和学习率,可以获得最佳训练参数。比较BP神经网络算法和改进的BP神经网络算法,由于改进BP神经网络算法适当地改变了正确学习率和权值的比率,所以它比BP神经网络算法更好。在高温和高电压的情况下,改进BP神经网络算法将有更好的预测性能,这是因为改进BP神经网络算法有大数值的泛化能力。 关键字:BP神经网络算法;改进BP算法;电渗析法分离百分比;改进BP算法;自适应学习算法 1.导言 电渗析(ED)是在电势差驱动力的帮助下,离子从溶液的一个带电膜分离到另一个电膜的过程。这一过程广泛用于生产饮用水及从半咸水和海水中处理水、处理工业废水、从污水和盐生产恢复有用的物料。文献[1-6]中对电渗析的应用和基本原则进行了检验。因为其化学稳定性高、灵活性和由于其强大的离子特征带来的高离子电导率,人们开发了许多使用离子交换膜的电渗析应用并且商业化[7-10]。两种不同类型的离子交换膜用于常规电渗析:阳离子交换(CEM)和阴离子交膜换(AEM),分别渗透阳离子和阴离子[11]。 然而,在性能经营过程中,电流密度应保持小于极限电流密度,因为水的离解引起的结垢和膜破损[12]。所以测定的极限电流密度和系统的潜力也被执行。极限电流密度是最大电流密度(当前每单位面积膜),它可以使用而且不会导致高的电阻和较低的电流功率等负面影响。在极限电流密度下,在阳离子交换膜或阴离子交换膜表面的阳离子或阴离子的浓度,恰到好处的,在稀室内消耗的溶液 +?是零[12-14]。超越极限电流密度后,水分解时生成的H和OH运输一部分的电流[15]。 人工神经网络(人工神经网络)利用相互联系的数学节点或神经元以形成一个网络,可以模拟复杂的功能关系[16]。它的发展始于20世纪40年代以帮助认知科学家理解中枢神经系统的复杂性。它已经稳步发展,并适应科学的许多领域。基本上,人工神经网络是源自人类的大脑在学习的过程中的数值结构灵感。他们构造作为替代的数学工具用于解决不同领域问题的系统辨识、预测、模式识别、分类、过程控制及其他许多 [17]。人工神经网络已广泛的在膜过程中应用(反渗透、纳滤、超滤、微滤膜、膜过滤、气体分离、膜生物反应器和燃料电池) [18]。然而,在文献中很少有几条应用人工神经网络预测电渗析过程中SP(分离百分比)的记录。 人工神经网络中受到最多关注的是反向传播网络(BPN)[19]。BPNs有分层的前馈网络框架。在经典BPNs结构中,产出的每个图层被直接送到每个神经元的下一层。BPNs有许多层,但人们认为至少三个图层:一个接收和分发输入的输入层、捕捉输入和输出的非线性关系的中间层或隐藏层,和一个生产计算数据的输出层。BPNs神经元也可能包含产生恒定的输出但没有收到输入这样的偏差 1 [20-21]。BP算法基于神经网络错误最小化。这些错误被描述为期望的产出与实际之间的差异 [22]。当训练精度满足时即训练完成(图1)。 BPNs操作具有更好的推广和容错功能,然而,它也有一些不足之处: (1)慢收敛性可能导致较长的训练时间。 (2)在训练过程中出现可能局部极值点。 所以在这项研究中,改进的BP算法被用来改进电渗析过程中分离百分比的预测能力。在这篇论文中,自适应学习算法和灵活的BP算法是改进的BP算法在电渗析过程中的应用。 2.材料和 2.1.实验仪器和材料 电渗析过程中,在表1中显示了实验仪器和材料。 此外,其他实验材料:量筒、烧杯、去离子水等。这些实验的目的是研究进料浓度、温度、电压和流通率对电渗析电池性能的影响。 2.2. 电渗析池和电渗析膜 有机玻璃(PMMA)做成的板和框架的电渗析池被用来进行电渗析实验(图2)。电渗析池由三部分组成,一对CEM(阳离子交换膜)和AEM(阴离子交换膜)和一对电极。电渗析池的长度、宽度和高度分别为0.191m、0.021m和0.181m。阳 2离子交换膜和阴离子交换膜有效区域都是0.11×0.09m。这两个电极由纯铂金制 2 2成,每个电极的表面积是0.115×0.09m。稀释容量和浓缩容量分别是 330.12×0.1×0.003m和0.12×0.1×0.006m。 NaCl(氯化钠)溶液被输入到电渗析池的三个单元格中。阳离子交换膜(CEM)和阴离子交换膜(AEM)分别有阳离子渗透性和阴离子渗透性。两层膜被并排沉浸在溶液中,并有电流通过溶液。正离子迁移到阴极,阴离子迁移到阳极。注入的溶液被划分成两个流。一个是稀释水,另一个是浓缩水。在一定的流速下,可以操作电渗析池去除电离子。 离子在电渗析池中被电解,阴极和阳极发生的反应如下所示: 阳极反应: ?2Cl?2e?Cl? 2 +HO?2e?1/2O?+2H 22 阴极反应: ?2HO+2e?H?+2OH。 22 在阳极反应中,生成了Cl和O。此外,在阴极反应中生成了H。这些气体222可能增加电渗析池的电阻,因此两个小洞被用来释放电渗析池板上的气体。集中的气流被处理掉,以防止这些气体积聚。电解膜的物理和化学特性列表(表2)。 2.3.电渗析设备安装程序 电渗析设备安装包括注入溶液的一个水槽(TK-01)、直流电源供应器和控制 01, GB-02)(图3)。电渗析设备的总高度注入溶液的流动速率的两个阀门(GB- 是0.5m。两个集中的浓缩流和稀释流没有循环利用,并为预测和分析电导率收集稀释流。 2.4.实验原理 电渗析过程是膜分离技术之一。根据直接电场电渗析技术利用选择性的离子交换膜将电解质从溶液中分开的能力,以实现稀释、浓缩或纯化溶液的目的(图4)。 2.5. 极限电流密度的测定 极限电流密度(LCD)在电渗析过程中是一个重要参数,用于确定电阻和目前的利用率。通常情况下, 极限电流密度取决于膜、溶液属性、电渗析栈结构以及 3 各种操作参数,如稀释流的速度[23]。极限电流的测量方法是伏安法、pH值法等等。本文用伏安方法来确定极限电流密度,具体步骤如下。 在恒定温度、浓度和流量的情况下,调整电压表按钮并记录每一组的电压和电流。当电压很低时,电流与电压有线性关系。随着电压的增加,电流发生微妙的变化。极限电流是转折点, 我们就得到了极限电流密度。例如,T=35?C,C=0.5g/L,Q=0.5mL/s,随着电压逐渐增加,电流与电压有线性关系。当电流达到0.51A时,电压对电流有轻微影响,所以极限电流即拐点是0.51A(图5)。膜 22的有效面积是两个0.11×0.09m,所以极限电流密度是51.515A/m. 在实验中,工作电流不能超过极限电流。否则电渗析过程中将发生极化,分离百分比的预测也就毫无意义。使用伏安法,在所有的实验条件下获得的极限电流密度(表3)。 为了防止发生电渗析池极化,所有电流被控制限制电流下。在电渗析实验中, 2极限电流最大值是0.80 A,离子交换膜的有效面积0.11×0.09m,所以极限电 2流密度的最大值是80.808A/m. 2.6.实验数据 表4)。对四个因素进行研究:进料浓度(0.5 g / 实验在极限电流密度下开展( L,1 g / L 和1.5 g / L),稀释舱的流量(0.05毫升/秒,0.25毫升/秒, 0.5毫升/秒,0.75毫升/秒和1毫升/ s),反应温度(20?C,25?C, 30?C,35?C,40?C 和8 V)。 和45?C),外加电压(2 V,5 V 3.基于BP神经网络算法和改进的BP算法 3.1.BP神经网络 典型BP神经网络是一个全神经网络包括一个输入层、隐藏层和输出层[24,25], 训练过程的目标是调整权值。网络训练是一个无约束非线性最小化问题[26]。研究人员声称,一个隐层的网络所需的任何精度逼近任意连续函数[27-29]。 BP神经网络由向前反馈和误差反向传播两部分组成。向前传播,输入从输入层开始传播,被一些隐藏层处理后到达输出层,输出层的输出预测值与实际输出比较,它们之间的区别是聚合生成的误差。在误差反向传播中,当误差超出误差范围, 误差被调整回传播权值。学习过程一直持续到误差聚合目标值 (图6)。 4 3.2.BP神经网络的构建 有关于BP神经网络的一些预防: (1) 预处理样品。通常样品没有直接使用网络训练,而是从原始数据预处理。实验数据包含一些不确定因素训练。预处理方法是提高训练和测试数据可靠性的必要准备。 (2)优化初始权值。网络的初始权值影响最终的训练结果,影响网络是否能达到可接受的精度。 (3)选择隐藏的图层数和神经元的隐藏图层。隐藏图层和神经元的隐藏图层的选择,是直接影响复杂问题的映射能力的最关键的一步。现在可靠的算法从隐藏层和大量的神经元开始,训练和测试,然后增加它们的数量。比较不同训练和测试样本的结果,选择更合适的隐藏层个数及其神经元。在这项研究中,具有不同神经元隐层的三层神经网络用于这次电渗析实验。 (4)选择训练样本。网络所需的样本取决于复杂程度的映射关系。一般情况下,映射关系越复杂,需要越多训练样本。从所有数据中选择样本时,BPNs需要遵守以下原则:足够数量的、典型的和均匀的。 3.3.改进的BP算法 BPNs基于坚实的理论和严格的推导,然而BPNs包括慢收敛、偶尔出现的局部极值点,所以训练过程中有许多不足之处。在实际应用中,BP算法很难是见效,因此人们提出了一些改进的BP算法,以提高预测能力。有几种方法改进的BP算法,如附加动量方法,自适应学习率方法,灵活的BP算法,等等。在此论文中,自适应学习率方法和灵活的BP算法被用来预测在电渗析过程中的分离百分比并用于与BPNs比较预测能力。 3.3.1.自适应学习算法 BP算法的训练过程受不当学习速率的影响有慢收敛的缺点。BP算法中的权值调整取决于学习速率和斜率。在BP算法中,学习速率是恒定的。事实上,当学习速率较低、训练时间变长,收敛变得较慢。当学习率太高,出现了振荡和分歧,这会造成系统不稳定。自适应学习率如图7中所示。 自适应学习速率的基本原理:当学习速率(η)增加,将造成学习时间缩短;学习速率越高,越难收敛。在这种情况,学习速率应该会减少直到训练过程的收 5 敛。可以通过改变误差和斜率,也可以根据误差函数通过改变学习速率梯度调整学习速率。此外,通过探性地进行调整试可能改变总误差,规则如下所示: (1)如果的总误差(E)减少,学习速率需要增加。 (2)如果的总误差(E)增加,学习速率需要减少。 当新误差与原来误差的比值超过某一个值时(例如1.04),学习速率迅速下降。 3.3.2.灵活的BP算法 一般来说,用sigmoid函数来将数据从输入层传递到隐藏层。此外,sigmoid函数保持无限输入和有限界内输出。当输入大变量时,sigmoid函数的斜率将接近0。即使梯度发生了微妙的变化,这可能引起权值发生巨大的变化,所以权值逐渐倾斜最佳值,甚至使网络权值在修改过程中停止。 当训练过程振动时,数量可变的权值将会减少。在几个迭代中权值变化方向保持不变时,可变数量的权值将会增加。因此,灵活的BP算法在收敛速度上有超越其他改进BP算法的优势。 4.结果和讨论 BP神经网络和改进的BP算法的设计中,需要确定四个重要方面: 1)神经元传递函数的选择 2)隐藏图层节点的选择 3)学习率增加比率的选择 4)BP神经网络和改进的BP算法的泛化测试 4.1神经元传递函数的选择 传递函数用于彼此相邻的两层中的神经元之间的传递。而且这些传递函数必须是可诱导的。BP神经网络有一个或多个隐藏层,隐藏层神经元使用Sigmoid传递函数。输出层神经元使用纯线性传递函数且输出可以是任意值。如果输出层神经元的传递函数是Sigmoid传递函数,则整个网络输出会被限制在(?1,1)内。 Sigmoid传递函数包括log-sigmoid和tan-sigmoid两个函数。对于这两个函数,输入分别映射到(0,1)和(?1,+ 1)。log-sigmoid函数是一个单向传递函数。Tansigmoid函数是一个双向传递函数,并被称为双曲正切函数 (图8)。 6 在实际应用中,根据输入输出的关系使用某一个传递函数。如果输入不含负值,所以采纳log-sigmoid函数。如果包括负值,则采纳tan-sigmoid函数。在本文中,隐藏层神经元使用Sigmoid传递函数,输出层神经元用纯线性传递函数。 4.2 隐藏图层节点的选择 许多学者曾从事研究隐藏层的最佳节点。柯尔莫哥洛夫定理证明,只要一个隐层的节点足够多,神经网络的隐藏层可以以任意精度逼近非线性函数。然而,对于一个从输入到输出有限的映射,无限多的隐藏层节点是没有必要的。以及如何选择隐藏层节点仍是一个尚未解决的问题。隐藏层节点是通过经验和实验设计确定的。一般地,基于对输入和输出关系准确的反映,选择一个隐藏层的小节点,以保持网络的结构简单。但规模越小的节点,神经网络的泛化能力越糟糕。在相反,如果隐藏层的节点越大,训练过程中的复杂度也会随之升高,那么这种情况将导致过度拟合现象。在设计过程中,许多因素必须结合起来。在具体设计中, 首先选择一个隐藏层。如果增加隐含层节点无法获得更好的网络,层号和隐藏层节点仍需要添加。 在本文中,四个影响因素(电压、浓度、温度、流量)是一个输入层节点,并且分离百分比是一个输出层节点。所以一个隐藏层的节点数应在4和12之间。网络的结构是4:4:1, 4:10:1,4:12:1。通过MATLAB软件训练,性能图表如图9所以。 3? 在图9(A)中,总训练时间是74,最佳验证性能高于10,训练数据的MSE(均 3?方差)低于10并且MSE的测试样品和有效数据约0.005。在图9(B)中,总训练时 33??间是33,最佳验证性能高于10,训练数据的MSE低于10并且MSE的测试样品 3?和有效数据可能略高于10。在图9(C)中,总训练时间是101, 最佳验证性能高 33??于10,训练数据的MSE、测试样品和有效数据均高于10。这三个图的结果表明,4:10:1是最好的网络结构,因为它有最低的MSE值、最短的训练时间。 5.结论 BP神经网络和改进的BP算法作为预测氯化钠溶液电渗析实验分离百分比两种方法,改进的BP算法比BP神经网络优越。改进的BP算法弥补了BP神经网络不合适的学习速率和权重训练过程中的缺陷,并且改进的BP算法使用增加学习 7 速率和权值的方法。灵活的BP算法是一种改进的BP算法的方法,其预测显然比BP神经网络更好。 在不同的训练参数 (神经元的传递函数, 大量的隐藏层的神经元和学习速率) 这个条件下讨论、研究BP神经网络和改进的BP算法的预测能力。我们获得了最优训练参数。本文的隐藏层神经元使用Sigmoid传递函数,并且输出层神经元使用纯线性传递函数。4:10:1网络是最好的网络结构,因此最优的隐藏层节点是10。1.05在学习速度训练数据中作为最佳的增加比率。然而, 由于实验器材的定位和极化,导致最优训练参数值受限。 浓度、流量、温度和电压与分离比例呈现非线性关系,温度和电压与分离比例呈现正相关关系, 浓度和流率与分离比例呈现负相关关系。对于非线性关系,改进的BP算法能够更好的预测。实验结果表明,改进的BP算法对复杂的数据组有泛化的、高效的和自适应的能力,使得为复杂系统建模成为有吸引力的选择,比如水处理过程和膜技术。 8 Studies on prediction of separation percent in electrodialysis process via BP neural networks and improved BP algorithms abstract In the electrodialysis process, separation percent (SP) had nonlinear relationships with a number of influencing factors (feed concentration (C),flow rate of dilute compartment (Q), reaction temperature (T) and applied voltage (V)), and the relationships were hard to express by a simple formula. And four influencing factors had remarkable effects on SP. In this paper, the four factors were studied in the electrodialysis experiments. Back propagation (BP) neural networks and improved BP algorithms were applied on the prediction of SP, and their prediction capabilities could reflect generalization and adaptive abilities on complex data which had nonlinear relationships with each other. And with different structures of neural networks, transfer functions of neurons and learning rates, the optimum training parameters were obtained. Comparing BP neural networks with improved BP algorithms, improved BP algorithms were better than BP algorithm, due to changing with increasing ratios of learning rates and weights properly. And in the condition of high temperatures and voltages, the improved BP algorithms were predicted to have better performance, this was because improved BP algorithms had the generalization ability for high values. Keywords: BP neural networks; Improved BP algorithms; Electrodialysis Separation percent; Flexible BP algorithm; Adaptive learning rate method I 1. Introduction Electrodialysis (ED) is an electro-membrane process for separation of ions across charged membranes from one solution to another with the aid of an electrical potential difference used as a driving force. This process has been widely used for production of drinking and processed water from brackish water and sea water, treatment of industrial effluents, recovery of useful materials from effluents and salt production. The basic principles and applications of ED were reviewed in the literatures[1–6]. Numerous versatile industrial applications of ED using ion-exchange membranes were developed and commercialized because of their high chemical stability, flexibility and high ionic conductivity due to their strong ionic characteristics[7–10]. Two different types of ion-exchange membranes are used in conventional electrodialysis: cation-exchange (CEM) and anion-exchange (AEM) membranes, which are permeable to cationic and anionic species, respectively[11]. However, in operating an electrodialyzer, the current density should be maintained less than the limiting current density because water dissociation gives rise to scale formation and membrane breakages[12]. So the determination of the limiting current density and potential for the system is also performed. The limiting current density is the maximum current density (current per unit membrane area) that can be used without causing negative effects such as higher electrical resistance and lower current efficiency. At the limiting current density, the concentration of a cation or an anion at the surfaces of the cation exchange or anion-exchange membrane, as appropriate, in the cells with the depleted solution will be zero[12–14]. At and beyond the limiting current density, H + and OH ? generated upon dissociation of water transport a part of the electric current[15]. Artificial neural networks (ANNs) utilize interconnected mathematical nodes or neurons to form a network that can model complex functional relationships[16]. Its development started in the 1940s to help cognitive scientists to understand the complexity of the nervous system. It has been evolved steadily and was adopted in many areas of science. Basically, ANNs are numerical structures inspired by the learning process in the human brain. They are constructed and used as alternative mathematical tools to solve a diversity of problems in the fields of system identification, forecasting, pattern recognition, classification, process control and many others[17]. Artificial neural networks have been used in a wide range of membrane process applications (reverse osmosis, nanofiltration, ultrafiltration, microfiltration, membrane filtration, gas separation, membrane bioreactor and fuel cell) [18]. However, there are a few records in the literature which apply artificial neural networks for the prediction of SP in the electrodialysis process. One ANN which has received the most attention is back propagation network (BPN) [19]. BPNs have hierarchical feed forward networks frame. In the classical structure of BPNs, the outputs of each layer are sent directly to each neuron of the next layer. There are many layers, but at least three layers are considered: an input layer receives and distributes inputs, a middle or hidden layer captures the nonlinear relationships of inputs and outputs, and an output layer produces calculated data. BPNs also may contain a bias neuron that produces constant outputs, but receives no I inputs[20,21]. BP algorithm is based on minimization of errors in neural networks. The errors are described as difference between the desired outputs and the actual ones [22]. The training is completed when the precision of the training is met (Fig. 1). BPNs can be operated with better generalization and fault-tolerant capabilities, however, it has some shortcomings: (1) Slow astringency can lead to a longer training time. (2) Local extremum point may emerge in the training process. So in this study, improved BP algorithms were used to improve the prediction capability of separation percent in the electrodialysis process. In this paper, adaptive learning rate method and flexible BP algorithm were the methods of improved BP algorithms to be applied in the electrodialysis process. 2. Materials and methods 2.1. Experimental instruments and materials In the electrodialysis process, the experimental instruments and materials were shown in Table 1. Moreover, other experimental materials were measuring cylinders, beakers, deionized water and so on. The purpose of these experiments was to study the effects of feed concentration, temperature, voltage and flow rate on the electrodialysis cell performance. 2.2. Cell and membranes A plate and frame of electrodialysis cell which was made from polymethylmethacrylate (PMMA) was used to conduct the electrodialysis experiments (Fig. 2). The electrodialysis cell consisted of three parts and packed with a pair of CEM (cation exchange membrane) and AEM (anion exchange membrane) and a pair of electrodes. The overall dimensions of length, width and height of the electrodialysis cell were 0.191 m, 0.021 m and 0.181 m, respectively. The effective areas of the CEM and AEM were both 0.11 ×0.09 m 2 . Both electrodes were made of pure platinum. The surface area of each electrode was 0.115× 0.09 m 2 . And the volumes of dilute and concentrate compartments were 0.12 ×0.1 × 0.003 m3 and 0.12 × 0.1 ×0.006 m 3 , respectively. NaCl (sodium chloride) solution was entered into the three compartments of the cell. Cation-exchange (CEM) and anion-exchange (AEM) membranes were permeable to cationic and anionic species, respectively. The two membranes were immersed in parallel, and an electric current was passed through the solution. The cations migrated to the cathode, and the anions migrated to the anode. The feed solution was divided into two streams. One stream was diluted water, and the other was concentrated water. Under a certain flow rate, the electrodialysis cell could operate to remove electrolytic ions. The ions were electrolyzed in the electrodialysis cell, so the cathode and anode occurred reactions as follows: Anode reactions: 2Cl ? ?2e?Cl2? H2O?2e?1=2O2? þ2H þ Cathode reaction: 2H2Oþ2e?H2?þ2OH ? . II In the anode reaction, Cl2 and O2 were produced. Moreover, H2 was produced in the cathode reaction. These gases could increase the resistance of the electrodialysis cell, so two little holes were used to release gases on the plate of the electrodialysis cell. And the concentrated streams were disposed off to prevent accumulation of these gases. List the physical and chemical characteristics of the membranes (Table 2). 2.3. Electrodialysis setup Electrodialysis setup consisted of a tank of feed solution (TK-01), a DC power supply and two valves (GB-01, GB-02) for controlling flow rates of feed solution (Fig. 3). The total height of the electrodialysis process was 0.5 m. No cyclic regime was used for two concentrated streams and a diluted stream, and the diluted stream was collected for conductivities to predict and analysis. 2.4. Experimental principle The electrodialysis process was one of membrane separation technologies. Under the direct electric field, electrodialysis technology utilized the selectivity ability of ion-exchange membranes to separate electrolytes from the solution, in order to realize the purposes of dilution, concentration or purification of the solution (Fig. 4). 2.5. Determination of the limiting current densities The limiting current densities (LCDs) in the electrodialysis process are an important parameter which determines the electrical resistance and the current utilization. Usually, LCDs depend on membrane and solution properties as well as on the electrodialysis stack construction and various operational parameters such as the flow velocity of the diluted solution[23]. The methods of measurements of the limiting current density are voltage–current method, pH–current method and so on. In this paper, voltage–current method was used to determine the LCDs, the specific steps were as follows. In the condition of constant temperature, concentration, and flow rate, adjust the buttons of voltmeter, and record the groups of voltages and currents. When the voltages were low, the currents had a linear relationship with the voltages. And as the voltages increased, the currents changed slightly. The limiting currents were determined by inflection points, and the limiting current densities were also obtained. For example, T=35 ?C, C=0.5 g/L, Q=0.5 mL/s, as the voltages increased, the currents had a linear relationship with the voltages. The current was up to 0.51 A, the voltages had slight influence on the currents, so the limiting current, namely, the inflection point was 0.51 A (Fig. 5). The effective areas of the membranes were both 0.11×0.09 m 2 ,so LCD was 51.515 A/m 2 . In the experiments, the operating currents could not exceed the limiting current. Otherwise, polarization occurred in the electrodialysis process and the prediction of separation percent would be meaningless. Using voltage–current method, LCDs were obtained in all the experimental conditions (Table 3). To prevent the occurrence of polarization of the electrodialysis cell, all operating currents were controlled under the limiting currents. And in the electrodialysis experiments, the maximum limiting current was 0.80 A, and the effective areas of III ion-exchange membranes were 0.11 ×0.09 m 2 , so the maximum limiting current density was 80.808 A/m 2 . 2.6. Experimental data Experiments were carried out under the limiting current densities (Table 4). Four factors were studied: feed concentration (0.5 g/L, 1 g/L and 1.5 g/L),flow rate of dilute compartment (0.05 mL/s, 0.25 mL/s, 0.5 mL/s, 0.75 mL/s and 1 mL/s), reaction temperature (20 ?C, 25 ?C, 30 ?C, 35 ?C, 40 ?C and 45 ?C), and applied voltage (2 V, 5 V and 8 V). 3. BP neural networks and improved BP algorithms 3.1. BP neural networks A typical BP neural network was a full-connected neural network including an input layer, a hidden layer and an output layer[24,25].The goal of the training process was to adjust the weights. The networks training was an unconstrained nonlinear minimization issue[26].Some researchers claimed that the networks with a single hidden layer could approximate any continuous function to any desired accuracy[27–29]. BP neural networks were composed of two parts which were forward and error back propagation. In the forward propagation, the inputs spread from the input layer, after being processed by some hidden layers, then reaching the output layer, the predictive values of the outputs in the output layer were compared with the actual outputs and the differences between them were aggregated to generate the errors. In the error back propagation, when the errors were not in the ranges of errors, the errors were back propagated by adjusting the weights. The learning process continued until the errors converged to a targeted value (Fig. 6). 3.2. Construction of BP neural networks There were some precautions about BP neural networks: (1) Pretreat samples. Usually samples were not used directly for network training, but engaged in pretreatment for raw data. The experimental data contained some uncertain factors in BPN training. A preprocessing method was so necessary for preparing the training and testing data to enhance the reliability. (2) Optimize initial weights. Initial weights of the networks had impact on final training results, and affected the networks whether to achieve an acceptable accuracy or not. (3) Select a number of hidden layers and neurons of hidden layers. The selection of hidden layers and neurons of hidden layers which affected mapping capabilities of complex issues directly were the most critical step. Now the reliable algorithm was to start with a few hidden layers and a number of neurons, train and test, then increase their numbers. Comparing the results of different training and test samples, select more appropriate numbers of hidden layers and their neurons. In this study, three-layer neural networks which had different neurons of a hidden layer were applied in the electrodialysis experiments. (4) Choose training samples. The required samples of the networks depended on complex degrees of mapping relationships. Generally, the more complicated mapping relationships, the more training samples required. When choosing samples from all IV data, BPNs needed to obey the following principles: sufficient numbers, representative, and well-distributed. 3.3. Improved BP algorithms BPNs were based on solid theory and rigorous derivation, however, it was found that there were many shortcomings in the training process of BPNs, including slow convergence, the emergence of local extremum and so on. In the practical application, BP algorithm was difficult to be competent, so some improved BP algorithms were raised to enhance prediction capability. There were some methods about improved BP algorithms, such as, additional momentum method, adaptive learning rate method ,flexible BP algorithm and so on. In this paper, adaptive learning rate method and flexible BP algorithm were used to predict separation percent in the electrodialysis process and compared prediction capability with that of BPNs. 3.3.1. Adaptive learning rate method The training process of BP algorithm had a shortcoming of slow convergence which was affected by an inappropriate learning rate. In BP algorithm, the adjustments of weights depended on learning rates and gradients. And learning rate was constant in BP algorithm. In fact, when learning rate was lower, training time got longer and convergence became slower. When learning rate was too high, oscillation and divergence had emerged, this caused an unstable system. Adaptive learning rate was shown in Fig. 7. The basic principle of adaptive learning rate: when learning rate (η) was increased, this caused shortening learning time; the higher the learning rate, the harder the convergence, and in this condition, learning rate should be decreased until the convergence of the training process. Learning rate was adjusted by the changing of errors and gradients, and also by the gradients of the learning rate according to error function. Moreover, the changing of the total error may be proceeded by adjusting heuristically, the rules were as follows: (1) If the total error (E) decreased, learning rate needed to increase. (2) If the total error (E) increased, learning rate needed to decrease. When the ratio of a new error to the original error exceeded a certain value (e.g. 1.04), learning rate decreased rapidly. 3.3.2. Flexible BP algorithm Generally speaking, Sigmoid function was used to transfer data from the input layer to the hidden layer. Moreover, Sigmoid function kept infinite inputs within the limited realms of the outputs. When input variables were large, the slope of Sigmoid function would be close to 0. Even if the gradient changed slightly, this could induce the weights to make great changes, so the weights deviated gradually from the most optimistic value, and even made the weights of the networks cease in the amendment process. When the training process vibrated, variable quantities of the weights would be decreased. And when the changing direction of the weights is kept constant in several iterations, the variable quantities of the weights would be increased. Therefore, the convergence speed of flexible BP algorithm had an advantage over other improved BP algorithms. V 4. Results and discussion Four important aspects needed be determined in the design of BP neural networks and improved BP algorithm: 1) Selection of transfer functions of neurons 2) Selection of nodes of a hidden layer 3) Selection of increasing ratios of learning rates 4) Testing the generalization of BP neural networks and improved BP algorithm 4.1. Choice of transfer functions of neurons The functions were used to transfer between neurons of two layers that were adjacent with each other. And these transfer functions must be derivable. BP neural networks had one or more hidden layers, the neurons of hidden layers used Sigmoid transfer functions. The neurons of an output layer used Pureline transfer functions, and the outputs could be arbitrary values. If the transfer functions of the neurons of the output layer were Sigmoid transfer functions, the entire outputs of the networks were restricted in the range of [?1, 1]. Sigmoid transfer functions included log-sigmoid and tan-sigmoid functions. The inputs were mapped to (0, 1) and (?1, +1), respectively. Log-sigmoid function was a unipolar transfer function. Tan-sigmoid function was a bipolar transfer function and was called a hyperbolic tangent function (Fig. 8). In the practical applications, based on the relationships of inputs and outputs, a certain transfer function was used. Inputs contained no negative values, so log-sigmoid function had been adopted. If negative values were included, tag-sigmoid function had been used. In this paper, the neurons of the hidden layers used Sigmoid transfer functions, and the neurons of the output layer used Pureline transfer functions. 4.2. Choice of nodes of hidden layers Many scholars had engaged in studying the optimum nodes of hidden layers. Kolmogorov Theorem proved that as long as the nodes of a hidden layer were enough, a hidden layer of the neural networks could approach a nonlinear function with arbitrary precision. However, for a finite mapping from inputs to outputs, unlimited nodes of a hidden layer were not needed. And how to choose the nodes of the hidden layer was a problem that had not been solved yet. The nodes of the hidden layer were determined by experience and experiment design. Generally, based on reflecting on the relationships of inputs and outputs accurately, the small nodes of a hidden layer were chosen to keep the structure of the networks simple. But the smaller nodes, the worse generalization capability of the neural networks. On the contrary, if the nodes of the hidden layer were larger, the complexity degrees of training process were increased, and this condition caused over-fitting phenomenon. During the design process, many factors must be integrated. In the specific design, at first, a hidden layer was chosen. If increasing the nodes of the hidden layer could not acquire better networks, layer numbers and nodes of hidden layers needed to be added. In this paper, four influencing factors (voltage, concentration, temperature, flow rate) were nodes of an input layer, and separation percent was a node of an output layer. So the nodes of a hidden layer were between 4 and 12. The structures of the VI networks were 4:4:1, 4:10:1, and 4:12:1. Training by MATLAB software, performance graphs were shown in Fig. 9. From Fig. 9(A), the total training time was 74, the best validation performance ?3?3was above 10 , MSE (mean squared error) of training data was below 10 , and MSE of test samples and valid data were about 0.005. From Fig. 9(B), the total ?3training time was 33, the best validation performance was above 10 , MSE of ?3training data was below 10 , and MSE of test samples and valid data may be slightly ?3higher than 10 . From Fig. 9(C), the total training time was 101, the best validation ?3performance was above 10 , MSE of training data, test samples and valid data were ?3higher than 10 . From the three graphs, it was shown that 4:10:1 was the best structure of the networks, for it had the lowest value of MSE, and the shortest training time. 5. Conclusion BP neural networks and improved BP algorithms were employed as two prediction methods for separation percent of NaCl solution in the electrodialysis process, and improved BP algorithms were superior to BP neural networks. Improved BP algorithms remedied a defect of BP neural networks which had unsuitable learning rates and weights in the training process, and improved BP algorithms used the methods of changing with increasing ratios of learning rates and the weights. And flexible BP algorithm was a method of improved BP algorithms, and its prediction was obviously better than BP neural networks. It was discussed that in the condition of different training parameters (transfer functions of neurons, the number of neurons of a hidden layer and learning rates), the prediction capability of BP neural networks and improved BP algorithms were studied. The optimum training parameters were obtained. In this paper, the neurons of the hidden layers used Sigmoid transfer functions, and the neurons of the output layers used Pureline transfer functions. 4:10:1 network was the best structure of the networks, so the optimum nodes of a hidden layer were 10. 1.05 was used as the best increasing ratio of learning rate to train data. However, the optimum training parameters were their limited values, due to the localization of the experimental cell and polarization. Concentration, flow rate, temperature and voltage had nonlinear relationships with separation percent, and temperature and voltage had positive correlation relationships with separation percent, and concentration and flow rate had negative correlation relationships with separation percent. For nonlinear relationships, improved BP algorithms had better prediction. It was shown that improved BP algorithms had the generalization, efficiency and adaptive abilities for complex VII groups of data, and made attractive choices for modeling of complex systems, such as water treatment processes and membrane technologies. VIII
/
本文档为【用BP神经网络算法和改进的BP神经网络算法研究预测电渗析过程的分离百分比-外文翻译】,请使用软件OFFICE或WPS软件打开。作品中的文字与图均可以修改和编辑, 图片更改请在作品中右键图片并更换,文字修改请直接点击文字进行修改,也可以新增和删除文档中的内容。
[版权声明] 本站所有资料为用户分享产生,若发现您的权利被侵害,请联系客服邮件isharekefu@iask.cn,我们尽快处理。 本作品所展示的图片、画像、字体、音乐的版权可能需版权方额外授权,请谨慎使用。 网站提供的党政主题相关内容(国旗、国徽、党徽..)目的在于配合国家政策宣传,仅限个人学习分享使用,禁止用于任何广告和商用目的。

历史搜索

    清空历史搜索