劉金明,郭坤林,甄 峰,張鴻瓊,李文哲,許永花
·農(nóng)業(yè)生物環(huán)境與能源工程·
基于近紅外光譜的沼液揮發(fā)性脂肪酸含量快速檢測
劉金明1,2,3,郭坤林2,甄 峰1,3,張鴻瓊1,4,李文哲1,4,許永花5※
(1. 東北農(nóng)業(yè)大學(xué)工程學(xué)院,哈爾濱 150030;2. 黑龍江八一農(nóng)墾大學(xué)電氣與信息學(xué)院,大慶 163319;3. 中國科學(xué)院可再生能源重點(diǎn)實驗室,廣州 510640;4. 黑龍江省寒地農(nóng)業(yè)可再生資源利用技術(shù)及裝備重點(diǎn)實驗室,哈爾濱 150030;5. 東北農(nóng)業(yè)大學(xué)電氣與信息學(xué)院,哈爾濱 150030)
揮發(fā)性脂肪酸(Volatile Fatty Acids,VFA)作為厭氧發(fā)酵過程的重要中間產(chǎn)物,其在厭氧反應(yīng)器中的累積能夠反映出產(chǎn)甲烷菌的不活躍狀態(tài)或厭氧發(fā)酵條件的惡化。為了實現(xiàn)對農(nóng)牧廢棄物厭氧發(fā)酵進(jìn)行過程分析和狀態(tài)監(jiān)控,將近紅外光譜(Near Infrared Spectroscopy,NIRS)與偏最小二乘(Partial Least Squares,PLS)相結(jié)合構(gòu)建玉米秸稈和畜禽糞便厭氧發(fā)酵液乙酸、丙酸和總酸含量快速檢測模型。將競爭自適應(yīng)重加權(quán)采樣法(Competitive Adaptive Reweighted Sampling,CARS)與遺傳模擬退火(Genetic Simulated Annealing,GSA)算法相結(jié)合構(gòu)建CARS-GSA算法對沼液中的乙酸、丙酸和總酸進(jìn)行特征波長優(yōu)選,原始光譜數(shù)據(jù)1 557個波長點(diǎn)經(jīng)預(yù)處理和波長優(yōu)選后,得到乙酸、丙酸和總酸特征波長變量分別為135、101和245個,建立的回歸模型驗證決定系數(shù)分別為0.988、0.923和0.886,預(yù)測均方根誤差(Root Mean Squared Error of Prediction,RMSEP)分別為0.111、0.120和0.727,相對分析誤差分別為9.685、3.685和3.484,與全譜建模相比RMSEP分別減少了17.78%、15.49%和1.22%,能夠滿足農(nóng)牧廢棄物厭氧發(fā)酵過程發(fā)酵液中乙酸和丙酸含量的快速檢測需求,基本滿足總酸的檢測需求。結(jié)果表明,通過構(gòu)建CARS-GSA算法優(yōu)選乙酸、丙酸和總酸的敏感波長變量,參與建模的波長點(diǎn)數(shù)量顯著減少,有效降低了變量維度和模型復(fù)雜度,提升了回歸模型檢測精度和預(yù)測能力,為快速準(zhǔn)確檢測沼液VFA提供了新途徑。
厭氧發(fā)酵;揮發(fā)性脂肪酸;快速檢測;近紅外光譜;偏最小二乘;遺傳模擬退火算法;競爭自適應(yīng)重加權(quán)采樣
揮發(fā)性脂肪酸(Volatile Fatty Acids,VFA)作為厭氧發(fā)酵過程的重要中間產(chǎn)物,為產(chǎn)甲烷階段提供了底物[1]。產(chǎn)甲烷菌主要利用VFA形成甲烷,只有少部分甲烷由二氧化碳和氫氣生成,但二氧化碳和氫氣生成甲烷時也經(jīng)過高分子有機(jī)物形成VFA的中間過程[2]。VFA在厭氧反應(yīng)器中的積累能反映出產(chǎn)甲烷菌的不活躍狀態(tài)或厭氧發(fā)酵條件的惡化,較高的VFA濃度對產(chǎn)甲烷菌有抑制作用,過高的VFA濃度甚至?xí)?dǎo)致厭氧發(fā)酵發(fā)生“酸敗”[3]。在反應(yīng)器運(yùn)行過程中,發(fā)酵液的VFA濃度常用作厭氧發(fā)酵過程的重要監(jiān)控指標(biāo)[4]。通過監(jiān)測發(fā)酵液中VFA的變化情況,可以很好地了解有機(jī)物的降解過程以及產(chǎn)甲烷菌的活性和系統(tǒng)的運(yùn)行情況[5]。為了對厭氧發(fā)酵狀態(tài)進(jìn)行有效監(jiān)控,有必要對發(fā)酵液的VFA含量進(jìn)行快速、準(zhǔn)確測定。
傳統(tǒng)的VFA檢測方法主要有精餾法、高效液相色譜法、氣相色譜法和各種滴定技術(shù)[5-6]。但傳統(tǒng)檢測方法存在處理時間長、設(shè)備操作復(fù)雜等問題,難以滿足厭氧發(fā)酵過程中通過快速測定VFA實現(xiàn)發(fā)酵過程狀態(tài)監(jiān)測的需求。針對厭氧發(fā)酵過程狀態(tài)監(jiān)測對VFA快速檢測的需求,相關(guān)學(xué)者深入研究了快速滴定法[7]、新型色譜技術(shù)[8]、電化學(xué)傳感器[9]、生物傳感器[10]和光譜分析技術(shù)[11]在VFA快速檢測方面的應(yīng)用。光譜分析技術(shù)因其簡便、快捷、無損、低成本的優(yōu)勢,已在發(fā)酵液VFA檢測方面得到了廣泛應(yīng)用[12-13],其中以近紅外光譜(Near Infrared Spectroscopy,NIRS)定量分析技術(shù)的應(yīng)用最為廣泛[14-16]。在應(yīng)用NIRS對水體中的VFA含量進(jìn)行快速檢測方面,主要以光譜預(yù)處理方法和多元定量校正方法的研究為主,在VFA特征波長優(yōu)選方面尚需進(jìn)一步拓展,以消除不相關(guān)和非線性波長點(diǎn)對模型精度的影響。
當(dāng)前,NIRS特征波長優(yōu)選方法正朝著多種特征波長優(yōu)選方法相結(jié)合的方向發(fā)展,將區(qū)間偏最小二乘法(Interval Partial Least Squares,iPLS)[17]、協(xié)同區(qū)間偏最小二乘法(Synergy iPLS,SiPLS)[18]、反向區(qū)間偏最小二乘法(Backward iPLS,BiPLS)[19]、連續(xù)投影算法[20]、競爭自適應(yīng)重加權(quán)采樣法(Competitive Adaptive Reweighted Sampling,CARS)[21]等特征波長優(yōu)選算法與遺傳算法(Genetic Algorithm,GA)[22]、模擬退火算法(Simulated Annealing Algorithm,SA)[23]、粒子群優(yōu)化算法[24]等智能優(yōu)化算法相結(jié)合進(jìn)行NIRS特征波長變量優(yōu)選已成為重要研究方向[25-27]。在NIRS定量分析方面,GA因其強(qiáng)大的特征波長優(yōu)選能力已得到廣泛應(yīng)用[28-29],但GA存在早熟問題且進(jìn)化后期搜索效率低。
遺傳模擬退火(Genetic Simulated Annealing,GSA)算法是對GA的改進(jìn),通過結(jié)合SA的溫度參數(shù)設(shè)計適應(yīng)度函數(shù),引入Metropolis擾動解選擇復(fù)制策略,有效利用GA強(qiáng)大搜索能力的同時解決了GA的兩點(diǎn)不足,在NIRS特征波長優(yōu)選方面取得了較好的應(yīng)用效果[30]。GSA在與iPLS、SiPLS和BiPLS相結(jié)合進(jìn)行特征波長優(yōu)選方面的性能也顯著優(yōu)于GA[30-32],但在使用iPLS、SiPLS和BiPLS進(jìn)行特征譜區(qū)優(yōu)選時,難以避免譜區(qū)內(nèi)部存在冗余波長點(diǎn)。這些不相關(guān)和非線性的冗余波長點(diǎn)導(dǎo)致GSA編碼過長,嚴(yán)重影響了GSA特征波長點(diǎn)二次搜索的性能。
因此,本文針對以秸稈和糞便為原料的厭氧發(fā)酵過程中,沼液VFA快速檢測的需求,提出基于NIRS建立VFA快速檢測模型,并將GSA與CARS相結(jié)合構(gòu)建CARS-GSA算法進(jìn)行VFA特征波長優(yōu)選,有效解決iPLS、SiPLS和BiPLS敏感波段初步定位過程中存在冗余波長點(diǎn)的問題,進(jìn)而獲取滿足實際檢測需求的有效特征波長變量,以提高VFA快速檢測模型的效率和精度。
試驗用玉米秸稈取自東北農(nóng)業(yè)大學(xué)校內(nèi)試驗田,豬糞取自哈爾濱市三元畜產(chǎn)實業(yè)公司,牛糞取自哈爾濱市宇峰奶牛養(yǎng)殖農(nóng)民專業(yè)合作社,接種物取自黑龍江省寒地農(nóng)業(yè)可再生資源利用技術(shù)及裝備重點(diǎn)實驗室。將采集的玉米秸稈自然風(fēng)干后一部分經(jīng)鍘草機(jī)切成10 mm的秸稈段備用,另一部分經(jīng)錘片式粉碎機(jī)(10 mm篩網(wǎng))粉碎成秸稈粉備用。分別以秸稈段、秸稈粉、牛糞、豬糞、秸稈粉豬糞混合物(按總固體(Total Solid,TS)比1∶1)為厭氧發(fā)酵原料,以實驗室500 L發(fā)酵罐常年馴化正常產(chǎn)氣的牛糞厭氧發(fā)酵液為接種物,進(jìn)行批式厭氧發(fā)酵試驗。秸稈、牛糞、豬糞和接種物的TS濃度分別為86.02%、26.62%、31.22%和4.76%,按TS接種比1∶1,調(diào)整厭氧發(fā)酵原料和接種物添加量,使5種原料對應(yīng)的發(fā)酵系統(tǒng)起始TS濃度分別為7%、6%、8%、7%和7%。在中溫(36±1)℃恒溫水浴槽中,分別采用5和10 L下口瓶作為反應(yīng)器,進(jìn)行2個批次的厭氧發(fā)酵試驗,有效發(fā)酵容積分別為3.5和7 L。試驗過程中每天定時對厭氧發(fā)酵反應(yīng)器進(jìn)行手搖攪拌2次,混勻料液的同時避免浮渣結(jié)殼。為了獲取有代表性的VFA濃度數(shù)據(jù)樣本,采集發(fā)酵液樣品主要在批式厭氧發(fā)酵前半程進(jìn)行。5 L發(fā)酵罐從裝樣后第2天開始,每天8:00采集發(fā)酵液樣品40 mL存放于3個15 mL離心管中,共計采樣16次。為防止料液TS濃度變高,對厭氧發(fā)酵過程產(chǎn)生不良影響,于第8天補(bǔ)水300 mL。10 L發(fā)酵罐從裝樣后第2天開始采樣,共計采樣15次,不需補(bǔ)水;共計采集與制備發(fā)酵液樣品155個,于-20℃冰箱冷凍保存。
發(fā)酵液冷凍樣品溶解后在冷凍離心機(jī)中以12 000 r/min離心10 min后,取上清液待測。使用Nicolet公司的Antaris II型傅里葉近紅外光譜儀對采集樣品進(jìn)行透射光譜掃描,光譜采集范圍4 000~10 000 cm-1(1 000~2 500 nm),分辨率為8.0 cm-1,樣品掃描32次,數(shù)據(jù)保存格式為lg(1/T),背景每小時掃描一次,裝樣方式為1 mm光程石英比色皿前置通道掃描。在保持室內(nèi)溫濕度基本穩(wěn)定的情況下,每個樣品裝樣3次,取3次掃描平均值作為樣品的原始光譜。原始光譜的波長數(shù)量為1 557個,數(shù)據(jù)點(diǎn)間距為3.86 cm-1,起始波數(shù)為10 001.03 cm-1,結(jié)束波數(shù)為3 999.64 cm-1。
使用安捷倫GC-6890N氣相色譜儀測定厭氧發(fā)酵過程中沼液的VFA濃度。采用外標(biāo)法建立VFA標(biāo)準(zhǔn)曲線,先制備乙酸、丙酸、丁酸、異丁酸和異戊酸的混合標(biāo)準(zhǔn)溶液,再使用去離子水稀釋至6種不同濃度,并測定不同濃度標(biāo)準(zhǔn)溶液各成分對應(yīng)的出峰時間和積分面積。將混合溶液的保留時間與單品的保留時間進(jìn)行比較,根據(jù)已知標(biāo)準(zhǔn)溶液中各物質(zhì)的濃度和積分時間繪制標(biāo)準(zhǔn)曲線。對溶解、離心并采集透射光譜數(shù)據(jù)后的厭氧發(fā)酵液樣品上清液進(jìn)行VFA含量測定。將其與25%偏磷酸溶液按體積比10∶1進(jìn)行混合,然后再以12 000 r/min離心10 min后取上清液,將上清液使用0.45m超濾膜過濾,取濾液進(jìn)行VFA濃度測定。
1.4.1 CARS算法
CARS算法基于“適者生存”的原則,將蒙特卡洛采樣(Monte-Carlo Sampling,MCS)、指數(shù)衰減函數(shù)和自適應(yīng)加權(quán)采樣(Adaptive Reweighted Sampling,ARS)相結(jié)合獲取波長子集,基于偏最小二乘(Partial Least Squares,PLS)回歸系數(shù)絕對值的大小獲取一系列變量組合,并選擇交叉驗證均方根誤差(Root Mean Squared Error of Cross Validation,RMSECV)值最小的子集作為特征波長。CARS在迭代過程中引入MCS和ARS 2個隨機(jī)因素,難以保證每次優(yōu)選結(jié)果的一致性??梢圆捎枚啻芜\(yùn)行CARS算法,每次都選中的波長點(diǎn)代表著光譜數(shù)據(jù)中與待測目標(biāo)屬性相關(guān)性高的波長點(diǎn),選定這些多次都選中波長作為特征波長,能夠建立高性能的回歸模型。
1.4.2 CARS-GSA算法
CARS-GSA算法以CARS優(yōu)選后的特征波長為輸入,采用GSA算法對CARS優(yōu)選結(jié)果進(jìn)行再優(yōu)化,以剔除CARS優(yōu)選結(jié)果中相關(guān)性較差的波長點(diǎn),從而進(jìn)一步提高建模性能。CARS-GSA以CARS優(yōu)選后特征波長點(diǎn)數(shù)為碼長,以PLS回歸模型的折RMSECV為目標(biāo)函數(shù),按初始種群個數(shù)約為碼長的三分之一進(jìn)行二進(jìn)制編碼和種群初始化。“1”和“0”分別表示該波長點(diǎn)對應(yīng)的數(shù)據(jù)“是”、“否”選中參與運(yùn)算。在確定初始溫度、退溫操作,并計算適應(yīng)度函數(shù)值后,執(zhí)行多個輪次的GSA選擇、交叉、變異和Metropolis選擇復(fù)制進(jìn)化操作,完成NIRS特征波長點(diǎn)的優(yōu)選。多次執(zhí)行GSA算法對CARS優(yōu)選結(jié)果進(jìn)行再優(yōu)化,并選擇多次重復(fù)選中的波長點(diǎn)作為特征波長變量建立PLS回歸模型,能夠得到較高的回歸模型性能。
本文算法包括光譜預(yù)處理、樣本集劃分、特征波長優(yōu)選及回歸模型構(gòu)建等全部在Matlab R2012b軟件平臺中實現(xiàn)。
在采用安捷倫GC-6890N氣相色譜儀測定155個發(fā)酵液樣本的VFA濃度時,得到81個乙酸濃度有效數(shù)據(jù)、78個丙酸濃度有效數(shù)據(jù)和87個總酸濃度有效數(shù)據(jù)(總酸濃度為乙酸、丙酸、丁酸、異丁酸和異戊酸質(zhì)量分?jǐn)?shù)之和)。對獲得的VFA樣本有效濃度數(shù)據(jù)進(jìn)行四分位數(shù)分析,并繪制箱線圖如圖1所示。
圖1 樣本VFA濃度箱線圖
由圖1可知,乙酸樣本在低濃度區(qū)域占比較大,丙酸樣本略微偏向低濃度區(qū)域,總酸樣本分布比較均勻。乙酸樣本偏離嚴(yán)重的原因在于厭氧發(fā)酵產(chǎn)乙酸、產(chǎn)甲烷平衡期產(chǎn)甲烷菌能夠及時將生成的乙酸轉(zhuǎn)化為甲烷和二氧化碳,進(jìn)而使平衡期階段(在整個發(fā)酵周期中時間占比較大)的乙酸濃度偏低。
為消除光譜區(qū)域中平頂峰對建模結(jié)果的影響,先剔除原始光譜數(shù)據(jù)中波數(shù)4 933.02~5 295.57 cm-1的95個波長點(diǎn),再用剩余的1 462個有效波長點(diǎn)建立乙酸、丙酸和總酸回歸模型,并對不同光譜預(yù)處理方法下的回歸模型性能進(jìn)行評測。經(jīng)計算比較后確定乙酸濃度回歸模型采用的光譜預(yù)處理方法為MSC+SG,丙酸回歸模型采用的光譜預(yù)處理方法為SG+MSC,總酸回歸模型采用的光譜預(yù)處理方法為FD+SNV+SG。樣品原始光譜及預(yù)處理后的乙酸、丙酸和總酸光譜數(shù)據(jù)的平均光譜如圖2所示。
圖2 樣品光譜數(shù)據(jù)
對81個乙酸樣品的原始光譜依次進(jìn)行MSC和SG平滑處理后,使用SPXY法劃分為60個校正集樣本和21個驗證集樣本;對78個丙酸樣品的原始光譜數(shù)據(jù)依次進(jìn)行SG平滑和MSC處理后,使用SPXY法劃分為60個校正集樣本和18個驗證集樣本;對87個總酸樣品的原始光譜數(shù)據(jù)依次進(jìn)行FD、SNV和SG平滑處理后,使用SPXY法劃分為70個校正集樣本和17個驗證集樣本。乙酸、丙酸和總酸濃度值如表1所示。
表1 樣品VFA濃度
注:SD是Standard deviation的縮寫,NS是Number of sample的縮寫。
Note: SD is short for standard deviation, NS is short for number of sample.
2.2.1 CARS特征波長優(yōu)選
在使用CARS優(yōu)選乙酸回歸模型特征波長時,先執(zhí)行500輪次CARS算法,再按重復(fù)選中次數(shù)遞增的方式選取RMSEP最小時對應(yīng)的特征波長優(yōu)選結(jié)果作為CARS的特征波長(記為CARS500)。執(zhí)行500次CARS算法共得到乙酸特征波長383個以波數(shù)表示,下同,選中次數(shù)最多的特征波長波數(shù)為4 416.19 cm-1,對應(yīng)著乙酸-CH3基團(tuán)的組合頻,選中次數(shù)為457次。選中次數(shù)較多的特征波長點(diǎn)主要分布在4 000~4 600、4 750~4 930、5 300~5 500、5 750~6 050、6 750~7 100和7 500~7 800 cm-1區(qū)域。其中4 000~4 600 cm-1對應(yīng)著乙酸-CH3基團(tuán)的組合頻,4 750~4 930 cm-1對應(yīng)著C=O和-OH基團(tuán)的組合頻,5 300~5 500 cm-1對應(yīng)著-COOH基團(tuán)的一級倍頻,5 750~6 050 cm-1對應(yīng)著-CH3基團(tuán)的一級倍頻,6 750~7 100 cm-1對應(yīng)著C=O和-OH基團(tuán)的二級倍頻,7 500~7 800 cm-1對應(yīng)著-CH3基團(tuán)的二級倍頻。CARS500優(yōu)選特征波長與乙酸平均光譜如圖3所示。
圖3 CARS500優(yōu)選乙酸特征波長
為分析不同重復(fù)選中次數(shù)下,CARS500優(yōu)選特征波長的建模性能,建立RMSECV、RMSEP和波長點(diǎn)個數(shù)隨重復(fù)選中次數(shù)的變化關(guān)系,如圖4所示。
由圖4可知,RMSECV隨著選中波長點(diǎn)個數(shù)的減少整體上呈先迅速減少、再波浪狀向前、最后跳躍式快速上升的形式,其中波長點(diǎn)數(shù)為120時,RMSECV得到最小值0.163,對應(yīng)重復(fù)選中次數(shù)為39次。RMSEP隨選中波長點(diǎn)個數(shù)減少整體呈鋸齒型變化并逐漸增加的形式,其中重復(fù)選中次數(shù)為30、選中波長數(shù)量為142時,所建PLS回歸模型的RMSEP獲得最小值為0.116。采用RMSEP最小時對應(yīng)的142個波長點(diǎn)作為CARS500優(yōu)選的乙酸特征波長。
圖4 RMSE、波長數(shù)量和選中次數(shù)間的關(guān)系
2.2.2 CARS-GSA特征波長優(yōu)選
在使用CARS-GSA優(yōu)選發(fā)酵液乙酸特征波長時,以CARS500優(yōu)選的142個波長點(diǎn)為碼長隨機(jī)生成50個染色體構(gòu)建初始種群,執(zhí)行GSA算法進(jìn)行特征波長點(diǎn)二次優(yōu)選。GSA算法的初溫確定系數(shù)取100,退溫系數(shù)取0.9,進(jìn)化代數(shù)取100,交叉概率取0.7,變異概率取0.01,鄰域解擾動位數(shù)取10。連續(xù)執(zhí)行算法50次,優(yōu)選的乙酸特征波長中選中35次以上的波長共計14個。其中,波數(shù)4 057.49、4 319.77、4 354.48、4 358.33、4 362.19、4 366.05、4 408.48、4 412.33、4 416.19、4 420.05、4 531.90和4 539.61 cm-1對應(yīng)著-CH3基團(tuán)的組合頻,波數(shù)4 925.30 cm-1對應(yīng)著C=O基團(tuán)的組合頻,波數(shù)5 311.00 cm-1對應(yīng)著-COOH基團(tuán)的一級倍頻。CARS-GSA優(yōu)選特征波長與乙酸平均光譜如圖5所示。
圖5 CARS-GSA優(yōu)選乙酸特征波長
為分析CARS-GSA優(yōu)選特征波長的建模性能,建立RMSECV、RMSEP與波長點(diǎn)個數(shù)間的對應(yīng)關(guān)系,如圖6所示。由圖6可知,RMSECV和RMSEP隨選中波長點(diǎn)個數(shù)增加整體上呈先迅速減少、再趨于平緩、最后略有上升的趨勢,但RMSECV的最小值要早于RMSEP出現(xiàn)。RMSECV最小值對應(yīng)的波長點(diǎn)數(shù)為54、重復(fù)選中次數(shù)為26,RMSEP最小值對應(yīng)的波長點(diǎn)數(shù)為135、重復(fù)選中次數(shù)為10,說明僅以RMSECV最小確定特征波長的方式容易導(dǎo)致回歸模型產(chǎn)生過擬合的問題。因此,選擇RMSEP最小時對應(yīng)的135個選中波長作為CARS-GSA優(yōu)選的乙酸特征波長。由圖3和圖5中RMSECV和RMSEP最小值的對比可知,CARS-GSA優(yōu)選特征波長的建模性能優(yōu)于CARS-500的建模性能。
圖6 RMSE與波長數(shù)量間的關(guān)系
2.2.3 特征波長優(yōu)選結(jié)果
按上述方法執(zhí)行CARS500和CARS-GSA進(jìn)行丙酸和總酸特征波長優(yōu)選,得到101個丙酸特征波長和245個總酸特征波長。乙酸、丙酸和總酸特征波長分布情況如圖7所示。
由圖7可知,VFA特征波長全部位于8 000 cm-1以下的中低頻區(qū)域,其中4 000~4 933、5 296~5 600和6 600~7 200 cm-1區(qū)域分布的特征波長點(diǎn)最多,這3部分正好對應(yīng)著光譜數(shù)據(jù)中吸收峰較強(qiáng)、分辨率較好的區(qū)域。丙酸特征波長在4 100~4 500 cm-1區(qū)域有53個,對應(yīng)著-CH2和-CH3基團(tuán)的組合頻;在4 000~4 900 cm-1區(qū)域有7個,對應(yīng)著C=O和-OH基團(tuán)的組合頻;在5 300~5 320 cm-1區(qū)域有2個,對應(yīng)著-COOH基團(tuán)的一級倍頻;在5 670~5 700 cm-1區(qū)域有9個,對應(yīng)著-CH2基團(tuán)的一級倍頻;在6 000~6 070 cm-1區(qū)域有13個,對應(yīng)著-CH3基團(tuán)的一級倍頻;在6 860~7 060 cm-1區(qū)域有17個,對應(yīng)著C=O、-CH2和-OH的二級倍頻??偹崽卣鞑ㄩL在4 000~4 720 cm-1區(qū)域有139個,對應(yīng)著C-C、C=O、-CH、-CH2和-CH3基團(tuán)的組合頻;在4 800~4 930 cm-1區(qū)域有27個,對應(yīng)著C=O和-OH基團(tuán)的組合頻;在5 300~5 380 cm-1區(qū)域有19個,對應(yīng)著-COOH基團(tuán)的一級倍頻;在5 930~6 010 cm-1區(qū)域有11個,對應(yīng)著-CH、-CH2和-CH3基團(tuán)的一級倍頻;在6 590~6 600 cm-1區(qū)域有2個,對應(yīng)著C=O基團(tuán)的二級倍頻;在6 730~7 200 cm-1區(qū)域有47個,對應(yīng)著C=O、-CH、-CH2、-CH3和-OH的二級倍頻。通過分析乙酸、丙酸和總酸特征波長可知,CARS-GSA與CARS500優(yōu)選特征波長結(jié)果具有很好的一致性,CARS-GSA只是剔除掉CARS500優(yōu)選特征波長中選中次數(shù)較少的相關(guān)性較差波長點(diǎn)。
圖7 VFA特征波長優(yōu)選結(jié)果
為評測2種波長優(yōu)選算法的性能,以CARS500和CARS-GSA優(yōu)選后的特征波長變量作為PLS回歸模型的輸入,建立沼液VFA定量回歸模型,并與全譜建模結(jié)果(Full-PLS)、單次CARS(運(yùn)行10次取最佳結(jié)果)優(yōu)選特征波長的建模效果進(jìn)行對比,結(jié)果如表2所示。
表2 VFA PLS回歸模型評價指標(biāo)
注:PCs是principal components的縮寫。
Note: PCs is short for principal components.
由表2可知,在單次CARS優(yōu)選特征波長建立的VFA回歸模型中,乙酸和丙酸CARS回歸模型的性能優(yōu)于全譜建模,而總酸CARS回歸模型的性能弱于全譜建模。原因在于乙酸和丙酸的結(jié)構(gòu)相對簡單,CARS能夠快速定位到相關(guān)性高的特征波長點(diǎn),而總酸的結(jié)構(gòu)相對復(fù)雜,不同基團(tuán)對應(yīng)的特征波長點(diǎn)數(shù)量較多,當(dāng)使用CARS剔除波長點(diǎn)時可能去掉某些相關(guān)性較高的特征波長點(diǎn),導(dǎo)致建模性能受到影響。多次執(zhí)行CARS算法進(jìn)行特征波長優(yōu)選可以解決單次CARS算法優(yōu)選總酸特征波長建模性能較差的問題。
采用CARS-GSA作為厭氧發(fā)酵過程中發(fā)酵液乙酸、丙酸和總酸濃度的特征波長優(yōu)選方案,以優(yōu)選后的特征波長分別建立乙酸、丙酸和總酸濃度PLS回歸模型并進(jìn)行性能評測,其結(jié)果如圖8所示。
圖8 VFA實測值與預(yù)測值分布
1)本研究采用NIRS結(jié)合化學(xué)計量學(xué)方法進(jìn)行沼液VFA的快速檢測,構(gòu)建模型進(jìn)行特征波長優(yōu)選,建立的乙酸、丙酸和總酸PLS回歸模型驗證決定系數(shù)分別為0.988、0.923和0.886,預(yù)測均方根誤差分別為0.111、0.120和0.727,且RPD都大于3,能夠滿足農(nóng)牧廢棄物厭氧發(fā)酵過程中對發(fā)酵液乙酸和丙酸濃度的快速檢測需求,基本滿足總酸濃度的檢測需求。
2)CARS500在發(fā)酵液VFA濃度NIRS特征波長優(yōu)選方面具有良好的性能,通過多次執(zhí)行CARS算法并選取重復(fù)選中波長點(diǎn)作為特征波長的方式能夠有效提高建模精度和效率,并在一定程度上解決了CARS算法優(yōu)選特征波長結(jié)果的隨機(jī)性問題。
3)CARS-GSA采用GSA對多次CARS優(yōu)選的特征波長進(jìn)行二次優(yōu)化,能夠有效去除CARS500優(yōu)選波長中相關(guān)性較弱的冗余波長點(diǎn),在提高建模精度和檢測效率的同時,確立了乙酸、丙酸和總酸相關(guān)基團(tuán)與其特征波長的對應(yīng)關(guān)系和特征波長在光譜區(qū)間內(nèi)的分布規(guī)律。
[1]史風(fēng)梅,徐洪濤,盧玢宇,等. 溫度對養(yǎng)殖廢水厭氧發(fā)酵初期產(chǎn)酸的影響及其原因分析[J]. 農(nóng)業(yè)工程學(xué)報,2018,34(增刊):42-47. Shi Fengmei, Xu Hongtao, Lu Binyu, et al. Effects of temperature on production of volatile fatty acids in mesophilic anaerobic fermentation of swine wastewater and its cause analysis[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2018, 34(Supp.): 42-47. (in Chinese with English abstract)
[2]于佳動,趙立欣,馮晶,等. 序批式秸稈牛糞混合厭氧干發(fā)酵過程物料理化及滲濾特性[J]. 農(nóng)業(yè)工程學(xué)報,2019,35(20): 228-234. Yu Jiadong, Zhao Lixin, Feng Jing, et al. Physicochemical and percolating characteristics of sequencing batch dry anaerobic digestion of straw-cow manure mixture[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2019, 35(20): 228-234. (in Chinese with English abstract)
[3]宋香育,張克強(qiáng),房芳,等. 工藝措施對豬糞秸稈混合厭氧干發(fā)酵產(chǎn)氣性能的影響[J]. 農(nóng)業(yè)工程學(xué)報,2017,33(11):233-239. Song Xiangyu, Zhang Keqiang, Fang Fang, et al. Influences of different technological strategies on performance of anaerobic co-digestion of pig manure with straw in solid-state[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2017, 33(11): 233-239. (in Chinese with English abstract)
[4]Sbarciog M, Giovannini G, Chamy R, et al. Control and estimation of anaerobic digestion processes using hydrogen and volatile fatty acids measurements[J]. Water Science and Technology, 2018, 78(10): 2027-2035.
[5]Chatterjee B, Radhakrishnan L, Mazumder D. New approach for determination of volatile fatty acid in anaerobic digester sample[J]. Environmental Engineering Science, 2018, 35(4): 333-351.
[6]Lahav O, Morgan B E. Titration methodologies for monitoring of anaerobic digestion in developing countries: A review[J]. Journal of Chemical Technology and Biotechnology, 2004, 79(12): 1331-1341.
[7]Hey T, Sandstrom D, Ibrahim V, et al. Evaluating 5 and 8 pH-point titrations for measuring VFA in full-scale primary sludge hydrolysate[J]. Water SA, 2013, 39(1): 17-22.
[8]Ward A J, Bruni E, Lykkegaard M K, et al. Real time monitoring of a biogas digester with gas chromatography, near-infrared spectroscopy, and membrane-inlet mass spectrometry[J]. Bioresource Technology, 2011, 102(5): 4098-4103.
[9]Peris M, Escuder-Gilabert L. On-line monitoring of food fermentation processes using electronic noses and electronic tongues: A review[J]. Anal Chim Acta, 2013, 804: 29-36.
[10]Yu Z, Leng X, Zhao S, et al. A review on the applications of microbial electrolysis cells in anaerobic digestion[J]. Bioresource Technology, 2018, 255: 340-348.
[11]Nguyen Duc, Gadhamshetty Venkataramana, Nitayavardhana Saoharit, et al. Automatic process control in anaerobic digestion technology: A critical review[J]. Bioresource Technology, 2015, 193: 513-522.
[12]Bruni E, Ward A J, Kocks M, et al. Comprehensive monitoring of a biogas process during pulse loads with ammonia[J]. Biomass & Bioenergy, 2013, 56: 211-220.
[13]Falk H M, Reichling P, Andersen C, et al. Online monitoring of concentration and dynamics of volatile fatty acids in anaerobic digestion processes with mid-infrared spectroscopy[J]. Bioprocess and Biosystems Engineering, 2015, 38(2): 237-249.
[14]Stockl Andrea, Lichti Fabian. Near-infrared spectroscopy (NIRS) for a real time monitoring of the biogas process[J]. Bioresource Technology, 2018, 247: 1249-1252.
[15]Nespeca Maurílio Gustavo, Rodrigues Caroline Varella, Santana Kamili Oliveira, et al. Determination of alcohols and volatile organic acids in anaerobic bioreactors for H2production by near infrared spectroscopy[J]. International Journal of Hydrogen Energy, 2017, 42(32): 20480-20493.
[16]Li L, Peng X Y, Wang X M, et al. Anaerobic digestion of food waste: A review focusing on process stability[J]. Bioresource Technology, 2018, 248: 20-28.
[17]Rato Tiago J, Reis Marco S. Multiresolution interval partial least squares: A framework for waveband selection and resolution optimization[J]. Chemometrics and Intelligent Laboratory Systems, 2019, 186: 41-54.
[18]鄒小波,張俊俊,黃曉瑋,等. 基于音頻和近紅外光譜融合技術(shù)的西瓜成熟度判別[J]. 農(nóng)業(yè)工程學(xué)報,2019,35(9): 301-307. Zou Xiaobo, Zhang Junjun, Huang Xiaowei, et al. Distinguishing watermelon maturity based on acoustic characteristics and near infrared spectroscopy fusion technology[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2019, 35(9): 301-307. (in Chinese with English abstract)
[19]Zareef M, Chen Q S, Ouyang Q, et al. Prediction of amino acids, caffeine, theaflavins and water extract in black tea using FT-NIR spectroscopy coupled chemometrics algorithms[J]. Analytical Methods, 2018, 10(25): 3023-3031.
[20]張亞坤,羅斌,宋鵬,等. 基于近紅外光譜的大豆葉片可溶性蛋白含量快速檢測[J]. 農(nóng)業(yè)工程學(xué)報,2018,34(18):187-193. Zhang Yakun, Luo Bin, Song Peng, et al. Rapid determination of soluble protein content for soybean leaves based on near infrared spectroscopy[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2018, 34(18): 187-193. (in Chinese with English abstract)
[21]王巧華,梅璐,馬美湖,等. 利用機(jī)器視覺與近紅外光譜技術(shù)的皮蛋無損檢測與分級[J]. 農(nóng)業(yè)工程學(xué)報,2019,35(24): 314-321. Wang Qiaohua, Mei Lu, Ma Meihu, et al. Nondestructive testing and grading of preserved duck eggs based on machine vision and near-infrared spectroscopy[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2019, 35(24): 314-321. (in Chinese with English abstract)
[22]Jiang H, Xu W, Chen Q. Comparison of algorithms for wavelength variables selection from near-infrared (NIR) spectra for quantitative monitoring of yeast (Saccharomyces cerevisiae) cultivations[J]. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2019, 214: 366-371.
[23]Ye Dandan, Sun Laijun, Zou Borui, et al. Non-destructive prediction of protein content in wheat using NIRS[J]. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2018, 189: 463-472.
[24]Zhu Y W, Chen X Y, Wang S M, et al. Simultaneous measurement of contents of liquirtin and glycyrrhizic acid in liquorice based on near infrared spectroscopy[J]. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2018, 196: 209-214.
[25]楊瑋,李民贊,鄭立華,等. 冬棗氮素含量預(yù)測模型中特征波長選擇方法的應(yīng)用[J]. 農(nóng)業(yè)工程學(xué)報,2015,31(增刊2):164-168. Yang Wei, Li Minzan, Zheng Lihua, et al. Application of spectral screening method on prediction model of nitrogen content of jujube leaves[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2015, 31(Supp.2): 164-168. (in Chinese with English abstract)
[26]朱瑤迪,張佳燁,李苗云,等. 肽聚糖對肉制品中產(chǎn)氣莢膜梭菌芽孢萌發(fā)率影響及預(yù)測[J]. 農(nóng)業(yè)工程學(xué)報,2020,36(4): 287-293. Zhu Yaodi, Zhang Jiaye, Li Miaoyun, et al. Effect of different peptidoglycan on clostridium perfringens spore germination and quantitative prediction[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2020, 36(4): 287-293. (in Chinese with English abstract)
[27]Yun Y H, Bin J, Liu D L, et al. A hybrid variable selection strategy based on continuous shrinkage of variable space in multivariate calibration[J]. Analytica Chimica Acta, 2019, 1058: 58-69.
[28]Yang M, Xu D, Chen S, et al. Evaluation of machine learning approaches to predict soil organic matter and ph using vis-NIR spectra[J]. Sensors, 2019, 19(2): 263.
[29]Song J, Li G, Yang X. Optimizing GA-PLS model of soluble solids content in Fukumoto navel orange based on Vis-NIR transmittance spectroscopy using discrete wavelet transform[J]. Journal of the Science of Food and Agriculture, 2019, 99(11): 4898-4903.
[30]劉金明,初曉冬,王智,等. 玉米秸稈纖維素和半纖維素NIRS特征波長優(yōu)選[J]. 光譜學(xué)與光譜分析,2019,39(3):743-750. Liu Jinming, Chu Xiaodong, Wang Zhi, et al. Optimization of characteristic wavelength variables of near infrared spectroscopy for detecting contents of cellulose and hemicellulose in corn stover[J]. Spectroscopy and Spectral Analysis, 2019, 39(3): 743-750. (in Chinese with English abstract)
[31]劉金明,程秋爽,甄峰,等. 基于GSA的厭氧發(fā)酵原料碳氮比NIRS快速檢測[J]. 農(nóng)業(yè)機(jī)械學(xué)報,2019,50(11): 323-330. Liu Jinming, Cheng Qiushuang, Zhen Feng, et al. Rapid determination of C/N ratio for anaerobic fermentation feedstocks using near infrared spectroscopy based on GSA[J]. Transactions of the Chinese Society for Agricultural Machinery, 2019, 50(11): 323-330. (in Chinese with English abstract)
[32]Liu Jinming, Li Nan, Zhen Feng, et al. Rapid detection of carbon-nitrogen ratio for anaerobic fermentation feedstocks using near-infrared spectroscopy combined with BiPLS and GSA[J]. Applied Optics, 2019, 58(18): 5090-5097.
Rapid determination of volatile fatty acids in biogas slurry based on near infrared spectroscopy
Liu Jinming1,2,3, Guo Kunlin2, Zhen Feng1,3, Zhang Hongqiong1,4, Li Wenzhe1,4, Xu Yonghua5※
(1.,,150030,; 2.,,163319,; 3.,510640,; 4.,150030,; 5.,,150030,)
Volatile Fatty Acids (VFA), serving as important intermediate products in Anaerobic Digestion (AD), have been considered as the key variables in most AD monitoring strategies, as they respond to incoming imbalances, indicating the buffer capacity of digesters to process disturbance and imminent digester failure that caused by sudden operational changes. In order to ensure efficient operation of AD while improve the utilization rate of raw materials, it is necessary to accurately monitor and evaluate the operation state of biogas engineering, via detecting the concentrations of VFA in the process of biogas production with corn stover and animal manure as feedstocks. Previously, the rapid detection models of Acetic Acid (AA), Propionic Acid (PA) and Total Acid (TA) in biogas slurry have been constructed, using the Near Infrared Spectroscopy (NIRS) technique combined with the Partial Least Squares (PLS), aiming to overcome the time consuming and high-cost in the traditional chemical analysis method. However, a prediction model can trigger the high complexity and low accuracy, due to the spectroscopic data generally includes quantities of invalid redundant information. In this study, an integrated algorithm was presented, based on the Competitive Adaptive Reweighted Sampling (CARS) and genetic simulated annealing algorithm (GSA), to optimize the characteristic wavelength variables of AA, PA, and TA, and thereby to improve the efficiency and precision of NIRS detection models. An AD experiment was carried out with corn stover, pig manure and cow manure as feedstocks, where 155 samples of biogas slurry were collected. The NIRS data of biogas slurry was acquired in a transmittance mode using the AntarisTMII FT-NIR spectrophotometer equipped with a quartz cuvette. A Gas Chromatography (GC) system was used to measure the VFA of biogas slurry, where 81 valid data of AA, 78 valid data of PA, and 87 valid data of TA were obtained to establish the regression model. One segment of the spectrum with 95 wavelength points was removed from 4 933.02 to 5 295.57 cm-1, and 1462 wavelength variables remained, mainly due to the saturation of spectrum can be caused by the strong combination band of -OH from water. The spectral preprocessing methods were selected, according to the mean relative error of calibration set. Correspondingly, the samples were divided into the calibration set and validation set, using Sample Set Portioning based on Joint X-Y Distances (SPXY) algorithm. The number of characteristic wavelength variables for AA, PA, and TA were 135, 101, and 245, respectively. The PLS regression models were established with the characteristic wavelengths of AA, PA, and TA, where the results were the coefficients of multiple determination for prediction is 0.988, root mean squared error of prediction (RMSEP) of 0.111, and the residual predictive deviation (RPD) of 9.685 for AA, coefficients of multiple determination for prediction is 0.922, RMSEP of 0.120, and RPD of 3.685 for PA, coefficients of multiple determination for prediction is 0.886, RMSEP of 0.727, and RPD of 3.484 for TA. Meanwhile, compared with the whole spectrum model, the RMSEP in the CARS-GSA model decreased by 17.78%, 15.49%, and 1.22%, respectively, showing that the number of wavelengths significantly decreased after the optimization, whereas, the performance of regressive model was obviously higher than that of the whole wavelengths. The results demonstrate that the CARS-GSA model can fulfil the requirement of rapid detection for AA and PA concentrations in biogas slurry during anaerobic fermentation with agricultural waste as feedstocks, while basically meet the detection requirement of TA concentration. The CARS-GSA model also can be used to enhance the forecasting capability of the model, while reduce its complexity. The findings can provide a new way to improve the accuracy and robustness of prediction model, base on optimizing sensitive wavelengths for AA, PA, and TA, further for rapid and accurate measurement of VFA concentrations in biogas slurry.
anaerobic digestion; volatile fatty acids; rapid determination; near infrared spectroscopy; partial least squares; genetic simulated annealing algorithm; competitive adaptive reweighted sampling
劉金明,郭坤林,甄峰,等. 基于近紅外光譜的沼液揮發(fā)性脂肪酸含量快速檢測[J]. 農(nóng)業(yè)工程學(xué)報,2020,36(18):188-196.doi:10.11975/j.issn.1002-6819.2020.18.023 http://www.tcsae.org
Liu Jinming, Guo Kunlin, Zhen Feng, et al. Rapid determination of volatile fatty acids in biogas slurry based on near infrared spectroscopy[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2020, 36(18): 188-196. (in Chinese with English abstract) doi:10.11975/j.issn.1002-6819.2020.18.023 http://www.tcsae.org
2020-05-10
2020-06-28
中國科學(xué)院可再生能源重點(diǎn)實驗室(Y907k81001);國家重點(diǎn)研發(fā)計劃(2019YFD1100603);黑龍江省博士后面上資助(LBH-Z19087);黑龍江八一農(nóng)墾大學(xué)三橫三縱支持計劃(ZRCQC202007);黑龍江八一農(nóng)墾大學(xué)學(xué)成人才科研啟動計劃(XDB202006)
劉金明,博士,副教授,主要從事光譜分析技術(shù)在農(nóng)業(yè)領(lǐng)域的應(yīng)用研究。Email:jinmingliu2008@126.com
許永花,副教授,主要從事光譜分析技術(shù)方面的研究。Email:xyhsy@126.com
10.11975/j.issn.1002-6819.2020.18.023
O657.33
A
1002-6819(2020)-18-0188-09