فراتفکیک‌پذیری مبتنی بر نمونه تک‌تصویر متن با روش نزول گرادیان ناهمزمان ترتیبی

محورهای موضوعی : مهندسی برق و کامپیوتر

1 - دانشگاه تربیت مدرس
2 - دانشگاه تربیت مدرس

تاریخ دریافت : 1396/04/23 تاریخ پذیرش : 1396/04/23 تاریخ انتشار : 1395/10/01

کلید واژه: بهسازی تصویر متن افزایش تفکیک‌پذیری فراتفکیک‌پذیری بیزی فراتفکیک‌پذیری مبتنی بر نمونه الگوریتم نزول گرادیان,

چکیده مقاله :

در اين مقاله، یک روش جدید برای افزایش تفکیک‌پذیری تک‌تصویری تصاویر متن ارائه می‌شود. این روش مبتنی بر نمونه است یعنی برای فراتفکیک‌پذیری از یک مجموعه نمونه آموزشی که شامل وصله‌های با تفکیک‌پذیری بالا و پایین است استفاده می‌شود. بر اساس قاعده بیزی، یک تابع به عنوان درست‌نمایی و سه تابع به عنوان دانش اولیه در نظر گرفته می‌شوند. تابع مربوط به درست‌نمایی میزان شباهت با تصویر اولیه را توصیف می‌کند. سه تابع مربوط به دانش اولیه خصوصیات دومُدی بودن تصویر متن، یکنواخت‌بودن نواحی پس‌زمینه و متن و نزدیک‌بودن به مجموعه نمونه آموزشی را توصیف می‌کنند. با کمینه‌کردن این توابع انرژی طی فرایند تکرارشونده نزول گرادیان ناهم‌زمان ترتیبی، تصویر با تفکیک‌پذیری بالا به دست می‌آید. به جای کمینه‌کردن هم‌زمان ترکیب خطی توابع، آنها به ترتیب و با توجه به این که در تکرارهای متوالی الگوریتم چه تغییراتی در تصویر متن رخ می‌دهد کمینه می‌گردند. به این ترتیب دیگر نیازی به تعیین ضرایب ترکیب خطی توابع که برای تصاویر مختلف متغیر هستند نخواهد بود. نتایج آزمایش‌ها روی بیست تصویر متن با قلم‌ها، تفکیک‌پذیری‌ها، تارشدگی‌ها و نویزهای مختلف عملکرد بهتر و با حجم محاسباتی کمتر روش ارائه‌شده نسبت به روش‌های مشابه قبلی را نشان می‌دهد.

چکیده انگلیسی:

In this paper, a new method for resolution enhancement of single document images is presented. The proposed method is example based using an example set of low-resolution and high-resolution training patches. According to the Bayes rule, one function is considered as the likelihood or data-fidelity term that measures the fidelity of the output high-resolution to the input low-resolution image. As well, three other functions are considered as the regularization terms containing the prior knowledge about the desired high-resolution document image. Three priors which are fulfilled by the regularization terms are bimodality of document images, smoothness of background and text regions, and similarity to the patches in the example set. By minimizing these four energy functions through the iterative procedure of asynchronous sequential gradient descent, the HR image is reconstructed. Instead of synchronous minimization of the linear combination of these functions, they are minimized in order and according to the gradual changes in their values and in the updating HR image. Therefore, determining the coefficients of the linear combination, which are variable for input images, is no longer required. In the experimental results on twenty document images with different fonts, at different resolutions, and with different amounts of noise and blurriness, the proposed method achieves significant improvements in visual image quality and in reducing the computational complexity.

منابع و مأخذ:

[1] A. Abedi and E. Kabir, "Stroke width-based directional total variation regularisation for document image super resolution," IET Image Processing, vol. 10, no. 2, pp. 158-166, Feb. 2016.
[2] P. Milanfar, Super-Resolution Imaging, vol. 1, CRC Press, 2010.
[3] K. Donaldson and G. Myers, "Bayesian super-resolution of text in video with a text-specific bimodal prior," Int. J. Document Anal. Recognit., vol. 7, no. 2, pp. 159-167, Jul. 2005.
[4] C. M. Thillou and M. Mirmehdi, "An introduction to super-resolution text," Digital Document Processing, Advances in Pattern Recognition, vol. 16, no. 17, pp. 305-327, Sep. 2007.
[5] D. Datsenko and M. Elad, "Example-based single document image super-resolution: a global MAP approach with outlier rejection," Multidim Syst Sign Process, vol. 18, no. 2, pp. 103-121, Sep. 2007.
[6] M. Elad and D. Datsenko, "Example-based regularization deployed to super-resolution reconstruction of a single image," The Computer J., vol. 52, no. 1, pp. 15-30, Oct. 2009.
[7] J. Park, Y. Kwon, and J. H. Kim, "An example-based prior model for text image super-resolution," in Proc. 8th Int. Conf. on Document Analysis and Recognition, vol. 1, pp. 374-378, Sep. 2005.
[8] R. Zeyde, M. Elad, and M. Protter, "On single image scale-up using sparse-representations," Curves and Surfaces Lecture Notes in Computer Science, vol. 6920, pp. 711-730, Jan. 2012.
[9] R. Walha, F. Driram, F. Lebourgeois, and A. M. Alimi, "Super-resolution of single text image by sparse representation," in Proc. of the Workshop on Document Analysis and Recognition DAR'12, pp. 22-29, Aug. 2012.
[10] G. Caner and I. Haritaoglu, "ShapeDNA: effective character restoration and enhancement for arabic text documents," in Proc. 20th Int. Conf. on Pattern Recognition, ICPR'10, pp. 2053-2056, Jul. 2010.
[11] S. Baker and T. Kanade, "Limits on super-resolution and how to break them," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 24, no. 9, pp. 1167-1183, Nov. 2002.
[12] W. T. Freeman, E. C. Pasztor, and O. T. Carmichael, "Learning low-level vision," International J. of Computer Vision, vol. 40, no. 1, pp. 25-47, Oct. 2000.
[13] P. Thouin and C. Chang, "A method for restoration of low-resolution document images," Int. J. Document Anal. Recognit., vol. 2, no. 4, pp. 200-210, Jun. 2000.
[14] H. Q. Luong and W. Philips, "Robust reconstruction of low-resolution document images by exploiting repetitive character behaviour," International J. on Document Analysis and Recognition, vol. 11, no. 1, pp. 39-51, Oct. 2008.
[15] J. Banerjee and C. V. Jawahar, "Super-resolution of text images using edge-directed tangent field," in Proc. 8th IAPR Int. Workshop on Document Analysis Systems, DAS'08, pp. 76-83, Nov. 2008.
[16] A. Kheradmand and P. Milanfar, "A general framework for regularized, similarity-based image restoration," IEEE Trans. on Image Processing, vol. 23, no. 12, pp. 5136-5151, Dec. 2014.
[17] J. H. Friedman, J. L. Bentley, and R. A. Finkel, "An algorithm for finding best matches in logarithmic expected time," ACM Trans. on Mathematical Software, vol. 3, no. 3, pp. 209-226, Feb. 1977.
[18] M. V. W. Zibetti, F. S. V. Bazan, and J. Mayer, "Determining the regularization parameters for super-resolution problems," Signal Processing, vol. 88, no. 12, pp. 2890-2901, Dec. 2008.
[19] A. Panagiotopoulou and V. Anastassopoulos, "Super-resolution image reconstruction techniques: trade-offs between the data-fidelity and regularization terms," Information Fusion, vol. 13, no. 3, pp. 185-195, Jul. 2012.
[20] A. Agarwal and J. C. Duchi, Distributed Delayed Stochastic Optimization, arXiv: 1104.5525, 2011.
[21] H. Lu, A. Kot, and Y. Shi, "Distance-reciprocal distortion measure for binary document images," IEEE Signal Processing Letters, vol. 11, no. 2, pp. 228-231, Feb. 2004.
[22] S. M. Pincus, I. M. Gladstone, and R. A. Ehrenkranz, "A regularity statistic for medical data analysis," J. of Clinical Monitoring and Computing, vol. 7, no. 4, pp. 335-345, Feb. 1991.
[23] G. Louloudis, B. Gatos, I. Pratikakis, and C. Halatsis, "Text line detection in handwritten documents," Pattern Recognition, vol. 41, no. 12, pp. 3758-3772, Dec. 2008.
[24] L. Zheng, S. Wang, and Y. Liu, "Information theoretic regularization for semi-supervised boosting," in Proc. of the 15th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, KDD'09, pp. 1017-1026, Aug. 2009.
[25] L. M. Lorigo and V. Govindaraju, "Offline Arabic handwriting recognition: a survey," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 28, no. 5, pp. 712-724, Mar. 2006.

مقالات مرتبط

یک رهیافت فرااکتشافی چندهدفه برای بهبود پوشش و اتصال در شبکه‌های حسگر بی‌سیم
تاریخ چاپ : 1405/02/22
رویکرد ارزیابی هیجان نوین جهت مراقبت از سرطان مبتنی بر مدل‌های زبانی بزرگ
تاریخ چاپ : 1405/02/22
ارائه روشی برای مدیریت منابع در شبکه‌های Fog-DSDN با بهره‌گیری از معماری میکروسرویس و شبکه‌های ESN
تاریخ چاپ : 1405/02/22
چارچوب ترکیبی سبک‌وزن برای امنیت اینترنت اشیا با استفاده از جنگل تصادفی بهینه و انتخاب ویژگی تطبیقی در معماری لبه-ابری
تاریخ چاپ : 1405/02/22
یک چارچوب یادگیری نیمه‌نظارتی جهت دسته‌بندی دقیق موارد آزمون با بهره‌گیری از تعبیه‌های زبانی و ویژگی‌های معنایی متن
تاریخ چاپ : 1405/02/22
تکنیک هوشمند مبتنی بر الگوریتم چتر دریایی برای زمان‌بندی وظایف بر اساس اولویت در شبکه‌های IoT/Fog
تاریخ چاپ : 1405/02/22

اشتراک گذاری

آدرس مقاله

فراتفکیک‌پذیری مبتنی بر نمونه تک‌تصویر متن با روش نزول گرادیان ناهمزمان ترتیبی