A general framework for predictive tensor modeling with domain knowledge

Zhu, Yada; He, Jingrui; Lawrence, Richard D.

doi:10.1007/s10618-014-0392-8

A general framework for predictive tensor modeling with domain knowledge

Published: 12 November 2014

Volume 29, pages 1709–1732, (2015)
Cite this article

Data Mining and Knowledge Discovery Aims and scope Submit manuscript

Yada Zhu¹,
Jingrui He² &
Richard D. Lawrence¹

513 Accesses
Explore all metrics

Abstract

In many real applications such as virtual metrology in semiconductor manufacturing, face recognition, and gait recognition in computer vision, the input data is naturally expressed as tensors or multi-dimensional arrays. Furthermore, in addition to the known label information, domain knowledge can often be obtained from various sources, e.g., multiple domain experts. To address such problems, in this paper, we propose a general optimization framework for dealing with tensor inputs while taking into consideration domain knowledge. To be specific, our framework is based on a linear model, and we obtain the weight tensor in a hierarchical way—first approximate it by a low-rank tensor, and then estimate the low-rank approximation using the domain knowledge from various sources. This is motivated by wafer quality prediction in semiconductor manufacturing. We also propose an effective algorithm named H-MOTE for solving this framework, which is guaranteed to converge. For each iteration, the time complexity of H-MOTE is linear with respect to the number of examples as well as the size of the weight tensor. Therefore, H-MOTE is scalable to large-scale problems. Experimental results show that H-MOTE outperforms state-of-the-art techniques on both synthetic and real data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Artificial Intelligence

Notes

Note that the vectorized version of the tensor may not be unique due to different orderings of the fibers.
The results of the the other two methods are similar as in Fig. 1.
Currently we are trying to address the challenge of different duration for different steps, so that the input tensors contain instantaneous measurements, which may improve the performance of quality prediction.
We omit the name of the variable due to confidential issues.
http://cs.nyu.edu/~roweis/data.html.

References

Bishop CM (2006) Pattern recognition and machine learning (Information Science and Statistics). Springer, Secaucus
Google Scholar
Cai D, He X, Han J (2006) Learning with tensor representation. Technical Report. University of Illinois at Urbana, Champaign
Google Scholar
Chang Y-J, Kang Y, Hsu C-L, Chang C-T, Chan TY (2006) Virtual metrology technique for semiconductor manufacturing. In: IJCNN
Dai G, Yeung D-Y (2006) Tensor embedding methods. In: AAAI
Grippo L, Sciandrone M (2000) On the convergence of the block nonlinear Gauss–Seidel method under convex constraints. Oper Res Lett 26(3):127–136
Article MathSciNet MATH Google Scholar
He X, Cai D, Niyogi P (2005) Tensor subspace analysis. MIT Press, In: NIPS
Hochreiter S, Obermayer K (2004) Classification, regression, and feature selection on matrix data. Technical Report. Technische Universitat, Berlin
Google Scholar
Kang P, Kim D, Lee H-J, Doh S, Cho S (2011) Virtual metrology for run-to-run control in semiconductor manufacturing. Expert Syst Appl 38:2508–2522
Article Google Scholar
Khan AA, Moyne JR, tilbury DM (2008) Virtual metrology and feedback control for semiconductor manufacturing processes using recursive partial least squares. J Process Control 18:961–974
Article Google Scholar
Kolda TG, Bader BW (2009) Tensor decompositions and applications. SIAM Rev 51(3):455–500
Article MathSciNet MATH Google Scholar
Li X, Lin S, Yan S, Xu D (2008) Discriminant locally linear embedding with high-order tensor data. IEEE Trans Syst Man Cybern 38(2):342–352
Article Google Scholar
Lynn S, Ringwood J, Ragnoli E, McLoone S, MacGearailt N (2009) Virtual metrology for plasma etch using tool variables. In: Advanced Semiconductor Manufacturing Conference
Mitchell TM (1997) Machine Learning. McGraw-Hill, New York
MATH Google Scholar
Platt JC (1999) Advances in kernel methods. MIT Press, Cambridge, MA.
Roweis ST, Saul LK (2000) Nonlinear dimensionality reduction by locally linear embedding. SCIENCE 290:2323–2326
Article Google Scholar
Su Y-C, Lin T-H, Cheng F-T, Wu W-M (2008) Accuracy and real-time considerations for implementing various virtual metrology algorithms. IEEE Trans Semicond Manuf 21(3):426–434
Article Google Scholar
Sun J, Tao D, Papadimitriou S, Yu PS, Faloutsos C (2008) Incremental tensor analysis: Theory and applications. TKDD 2(3):11
Article Google Scholar
Tao D, Li X, Hu W, Maybank S, Wu X (2005) Supervised tensor learning. In: ICDM, IEEE Computer Society, pp. 450–457
Tao D, Li X, Maybank SJ, Wu X (2006) Human carrying status in visual surveillance. In: CVPR, pp. 1670–1677
Tao D, Li X, Wu X, Maybank SJ (2007) General tensor discriminant analysis and gabor features for gait recognition. IEEE Trans Pattern Anal Mach Intell 29(10):1700–1715
Article Google Scholar
Tao D, Sun J, Shen J, Wu X, Li X, Maybank SJ, Faloutsos C (2008) Bayesian tensor analysis. In: IJCNN, pp. 1402–1409
Vapnik VN (2000) The nature of statistical learning theory. Springer, New York
Book MATH Google Scholar
Wang Q, Chen F, Xu W (2011) Tracking by third-order tensor representation. IEEE Trans Syst Man Cybern 41:385–396
Article Google Scholar
Wasserman L (2009) All of statistics. Springer, New York
Google Scholar
Zhu Y, He J, Lawrence RD (2012) Hierarchical modeling with tensor inputs. In: AAAI Conference on Artificial Intelligence, pp. 1233–1239

Download references

Author information

Authors and Affiliations

IBM T.J. Watson Research Center, Yorktown Heights, USA
Yada Zhu & Richard D. Lawrence
CIDSE, Arizona State University, Tempe, AZ, USA
Jingrui He

Authors

Yada Zhu
View author publications
You can also search for this author inPubMed Google Scholar
Jingrui He
View author publications
You can also search for this author inPubMed Google Scholar
Richard D. Lawrence
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Yada Zhu.

Additional information

Responsible editor: Chih-Jen Lin.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhu, Y., He, J. & Lawrence, R.D. A general framework for predictive tensor modeling with domain knowledge. Data Min Knowl Disc 29, 1709–1732 (2015). https://doi.org/10.1007/s10618-014-0392-8

Download citation

Received: 12 November 2012
Accepted: 24 October 2014
Published: 12 November 2014
Issue Date: November 2015
DOI: https://doi.org/10.1007/s10618-014-0392-8

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A general framework for predictive tensor modeling with domain knowledge

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Riemannian conjugate gradient method for low-rank tensor completion

Nonnegative low rank tensor approximations with multidimensional image applications

Optimality conditions for Tucker low-rank tensor optimization

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

A general framework for predictive tensor modeling with domain knowledge

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Riemannian conjugate gradient method for low-rank tensor completion

Nonnegative low rank tensor approximations with multidimensional image applications

Optimality conditions for Tucker low-rank tensor optimization

Explore related subjects

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now