THE FORMS INPUT PROJECT

Transcription

1 LINGVO FORM READER FINE READER FINE READER FORM READER LINGVO CASE STUDY ABBYY FORMREADER THE FORMS INPUT PROJECT IN THE STATE TAX SERVICE OF RUSSIA Can you imagine a country with citizens who don t know what is a tax return and have never seen a tax officer? The Russian Federation before 1996 was an example of such a country. At that time Russia was making its first steps on the way to market economics, but a civilized country with market economics cannot exits without an effective tax service. So the law about natural persons taxation was passed by the parliament in According to this law, every citizen must submit tax returns and income reports (the so-called W-2 forms) to the State Tax Service, and the law was to come into force as of January the 1 st, So in 1998 the overall volume of tax return and income report forms amounted to documents from Moscow region citizens only. As there are 7 pages on average per tax return form, the amount of paper is pages (30 five-ton full-loaded trucks are required to transport this pile of paper!). The data from these forms should be input into information databases of the State Tax Service (now the Ministry of Taxes and Duties). It went without saying that is was impossible to input such an amount of data in time manually, because it would require hundreds of operators doing an unskilled labor retyping forms for many months, as it was before with balance accounts input. And so they would have been typing in the forms instead of raising taxes from solvent citizens and thus guaranteeing the salary for physicians, teaches, scientists, etc., but an automatic solution was already on its way. By that time, ABBYY had already developed a number of data input projects for local offices of the State Tax Service. So in 1997 the State Tax Service contracted ABBYY to find a solution for this data input task. When one develops such projects in Russia, one usually encounters many difficulties of both technical and administrative nature that make the development age-long. But due to the extraordinary efforts made by Information Resources Department of the State Tax Service of Russia and ABBYY Software House the first automatic form data input system was developed and installed in less than 6 months. The project details The primary concerns of the developers were, naturally, the data input issues, but the scale of the project was much larger, because nothing of the kind has been done before in the State Tax Service. There were many organizational issues that should have been taken into account, from the question of where and how to print the forms through the procedures of their distribution among the local tax inspectorates to the questions of how and where to keep the forms archives and so on. So the task of ABBYY was not confined to the development of the data input system; ABBYY was to develop the whole technology of data collection and input. And ABBYY performed most successfully. A single example: a set of special folders and bags to be used for transportation of tax return forms from the local tax offices to the data input centers was designed ABBYY, in order to prevent the possible damage to the tax return forms. A lot of other technology 1

2 components were also designed, such as information banners telling the people how to fill in the forms, the system of signs in the tax offices telling the people where to get empty forms and where to put the filled ones, etc. One of the main features of this project that made it a difficult and unique task was its revolutionary character. Not a single similar project has ever been run in Russia, but that s not all as yet, there weren t a single country in the whole world who ever tried to develop a project of this kind. Moreover, this project was so serious that not a single mistake in the form input could be tolerated. So, it was decided that a pilot system should be developed and tested in Moscow, and only after the testing results are analyzed should the system be spread across the whole Russia. And the work began. The first step was to develop the tax return form itself, as such kind of document had never been used before in Russia. The new automatic system of processing documents required the machine-readable forms to be used. The machine-readable forms were new for our country. The only institution where such forms were used was the Pension Fund of Russia (semi-governmental organization). And in any case the tax return form were the first multi-page machine-readable forms. Another issue is that the Russian law requires a great number of official approvals from various governmental bodies to be got before a form of an official document may be used officially, and the final approval of the whole tax return form would have obviously taken much time. But the deadline was too close; so we ve made the decision only to receive approval of the future tax return form content, while the decisions on the overall appearance and structure of the form were to be made by the regional tax service offices. So, the content of the machinereadable tax return form was approved by the Ministry of Taxes and Duties, and ABBYY was entrusted to create a sample form layout. ABBYY specialists did everything possible to develop a layout that would suit the task best. First, red was taken as the background color. All inscriptions were in red too, a little darker than background. Using this color makes scanning easier and better. Second, each page was given its own unique barcode and unique letter. These features let the system choose the required form template automatically. The automatic template matching is crucial for the automatic recognition of multi-page forms. So, the tax return form was made ready for using. And then some additional problems emerged. As it was already mentioned, the forms contained several pages. So there was a problem of collecting the different pages of the single form of a single person together. There were some interesting suggestions, for example to assign each citizen a unique identification number, but unfortunately there was no time to do it, so the only way was to transport forms with all possible accuracy to avoid page mix. That was another reason for developing special folders and bags for forms transportation. Another difficulty was that the people themselves, the taxpayers. For the most part, they were to see a tax return form for the first time in their lives, and they tended to make mistakes in form filling because they were unable to follow the instructions properly, as it was all too unusual for them. So the tax officers were to verify all incoming information to prevent mistakes in the tax return data. 2

3 Technology of centralized tax return form processing This technology was developed with the help of ABBYY experts. All information about the income of each person is kept in a united regional database. The following channels are used to put information into the database: Manual data input from paper documents Automatic data input from scanning stations Data input from electronic media Information is archived periodically and kept in special archives from which it can be easily got at any moment. First ten thousand pages In January 1998 the first tax return forms were collected by in the tax services. The FormReader system was already installed on 33 workstations in the Moscow Tax Office 39 by that time. The system could process more than forms per day. According to the law, all citizens were to report about their income until the end of April, so a huge river of documents began flooding the tax offices in spring. And they had only 120 days after May 1 to process all of them! Such time limits made it impossible to input all forms manually. But the FormReader system eliminated this problem for at least one Tax Office in Moscow. 3

4 Igor Popov, Chief of the Data Processing Center of the Tax Ministry of Russia. The advantages of the system developed together with ABBYY are evident considering the season-dependent conditions of our work, says Igor Popov, former Chief of Tax Office 39, now Chief of the Moscow Data Processing Center of the Ministry of Taxes and Duties of Russia, If we tried to input documents in 120 days manually, we would have to employ more than 1000 professional typists and to buy the same amount of computers, and rent the workspace of about 6000 m 2. And what we would have to do with all these people and workspace during the rest 9 months of the year? Results The pilot project was declared a success, and the technology of automatic data input spread all over Russia. It was planned to install automatic form processing systems in 5 biggest regions of Russia in The Moscow Tax Office 39 became the Moscow Data Processing Center of the Ministry of Taxes and Duties of Russia. As of now, there are 3 systems of Automatic Data Input installed there, each consists of more than 30 workstations, and the tax return forms from almost every Moscow tax office are brought there for processing. The scanning capacity of the system is pages per minute on average. The scanner used is the BancTec S-185S scanner. One operator processes 3-6 pages per minute on average and the verification speed grows as the operator becomes more and more experienced. Further development and modification of the system performed by ABBYY s specialists each year increase the recognition quality and make the verification process more fast and convenient. Effectiveness and cost efficiency of automatic data input technology are now evident to everybody. The input quality featured by FormReader is much greater than the quality of manual input. Figure 1. Pilot project scheme installed in the Moscow Tax Office 39 in

5 Even without verification FormReader makes about 5 mistakes per 1000 handprinted characters * and only 1 mistake per 1000 printed ** characters. It is 4 times less mistakes than a professional typist does in the morning and almost 6 times less mistakes than the same typist does in the evening. Besides, the application automatically checks input information, using reference files, databases, compares sum in figures and sum in words, etc., thus achieving the 100% reliable recognition results. Besides, it goes without saying that FormReader is faster than any, even super professional, typist. Verification is the only stage when the throughput of the whole system is affected by human productivity. That is why ABBYY s experts paid special attention to this stage. We provided a whole set of tools and techniques to be used separately or in combination to organize efficient verification. By changing template settings and adding rules it is possible to fine-tune the verification process to minimize human efforts and solve specific tasks. The three-step verification technology implemented in FormReader (group, context, and in-form verification), customizable error display level and checking level, etc. provide the flexibility that will help you to build the most efficient processing technology. Thanks to all these tools, an operator needs only to check some uncertainly recognized characters. It allows to process pages per day instead of pages per day in case of manual input. As of now, 8 largest regions of Russia make effective use of the enterprise version of FormReader system. High form processing speed, perfect quality of recognition, automatic control of recognition results and low cost make FormReader the most efficient automatic data capture system. One of the most important results of this project is the increase of the total sum of taxes raised. The FormReader made the State Tax Service of Russia one of the first tax services in the whole world to use an automatic bulk input system of tax return forms. * for neatly written characters without corrections ** for documents with good printing quality 5

Visual Scoring the 360 View: 5 Steps for Getting Started with Easier, Faster and More Effective Lead Scoring Authors: Elissa Fink Wade Tibke Tableau Software p2 Lead Scoring For Most a Great Idea in Concept,

Chapter 2 My Early Days Trading Forex I want to talk about my early days as a Forex trader because I m hoping that my story will be something you can relate to. So it doesn t really matter if you are brand

How 2D Scanning Can Benefit your Business BarcodesInc www.barcodesinc.com 1.800.351.9962 What You'll Learn in this ebook 2D scanning vs. traditional laser scanning - what are the advantages? How 2D scanners

Small Business CRM; Who Can Leverage it & is it Affordable? Salesboom.com Are you running a successful SMB (small to medium sized business)? If you are, the need for Customer Relationship Management CRM

www.inovoo.com Novo Mail Email is and will remain a popular communications channel 01 For businesses... and for customers. Fast Convenient Easy You can have data without information, but you cannot have

Image Optimization GUIDE for IMAGE SUBMITTAL Images can play a crucial role in the successful execution of a book project by enhancing the text and giving the reader insight into your story. Although your

AUTOMATED FORMS PROCESSING Table of Contents Introduction..........................................................3 Form Types..........................................................3 What is a form?...................................................................................3

GE Healthcare Centricity Physician Office Electronic Medical Records Associated Physicians for Women, PLLC Just a few months after adding Centricity EMR, this obstetrics and gynecology practice was able

The National Number and the Automation of the Civil Records Project Ministry of Interior Administration of Civil Records The National Number Project is considered one of the most endeavoring and pioneering

Getting Started with Neat Neat s scanner, software, and cloud solutions create the ultimate Digital Filing System, making it easier for you to stay organized. This Getting Started Guide will help you get

Conquering the Myths of Inventory Management with MobileInventory www.waspbarcode.com Copyright 2006, Wasp Technologies. All rights reserved. No part of this publication may be copied, or used in any form

History of Optical Character Recognition Optical Character Recognition (OCR) What You Need to Know By Phoenix Software International Optical character recognition (OCR) is the process of translating scanned

Department of Industrial Engineering Sharif University of Technology Session# 7 Contents: The role of managers in Information Technology (IT) Organizational Issues Information Technology Operational and

The 9 Ugliest Mistakes Made with Data Backup and How to Avoid Them If your data is important to your business and you cannot afford to have your operations halted for days even weeks due to data loss or

Website Design Checklist Use this guide before you begin building your website to ensure that your website maximizes its potential for your company. 3 THING YOU SHOULD NEVER SAY ON YOUR WEBSITE (That I

Testing, What is it Good For? Absolutely Everything! An overview of software testing and why it s an essential step in building a good product Beth Schechner Elementool The content of this ebook is provided

Writing Effective Subject Lines By now you are probably getting pretty anxious to get out there and send your first email promotion. You might even have a great idea all ready for a campaign that you re

GRADUATE SCHOOL OF CORPORATE MANAGEMENT OF THE ACADEMY OF NATIONAL ECONOMY UNDER THE GOVERNMENT OF RUSSIAN FEDERATION Sergey O. Kalendzhyan Corporate training and companies restructuring Admission of Russia

Corporate Recruiter Tells All Tips, Secrets, and Strategies to Landing Your Dream Job! By Ryan Fisher INTRODUCTION It pains me to see so many people working day after day at unsatisfying jobs with limited

White Paper Performance Testing Methodology by Johann du Plessis Introduction One of the main concerns with a performance testing project is how much value the testing adds. Is performance testing worth

The Classes P and NP We now shift gears slightly and restrict our attention to the examination of two families of problems which are very important to computer scientists. These families constitute the

to know: repatriating funds know: repatriating funds Repatriating funds might sound complicated, but it just means converting money from a foreign currency back into your home currency. Individuals will

French Domestic Safeguards inspections with regard to quality management system of the operators, in the field of nuclear material control and accountancy Julie LASNEL-PAYAN, Flavien LEMOINE, Bruno AUTRUSSON,

Archiving Paper Documents Digital Archiving ARCHIVING PAPER BvLArchivio provides you with the option of archiving paper documents either manually or automatically. Both processes are possible in BvLArchivio

Achievements and challenges of the single market S&D responses to citizens top 10 concerns The European economic model must be based on three principles: competition which stimulates, co-operation which

Best Practices: Inventory Management for the Small to Medium-Sized Business You don t have to be a big business to use technology to improve your inventory management processes. In fact, if you are a smaller

Getting Started with Neat Cloud Service + Mobile App Neat transforms paper and electronic documents into organized digital files that are easy to find, use, and share. Neat Cloud Service and Mobile App

A Guide to Developing a Workflow System for Your Financial Advisory Firm A financial advisory firm cannot deliver a high level of service to its clients efficiently and profitably without a workflow system

7 Secrets To Websites That Sell By Alex Nelson Website Secret #1 Create a Direct Response Website Did you know there are two different types of websites? It s true. There are branding websites and there

Item Check In/Out NControl Security Integrations, LLC (NControl) is pleased to offer a new concept in RFID Tracking solutions, the NEAT Check In/Out System (NEAT CIOS). The system is a cost-effective way

Revolutionary Tabletop Automation Prescription Validation, Counting and Filling System Eyecon is a revolutionary pharmacy automation system Eyecon is fast becoming the preferred choice of pharmacies that

Improving access to R&D tax credits for small business Response by the Chartered Institute of Taxation 1 Introduction 1.1 We refer to the consultation document published on 16 January 2015 on Improving

A Dam with a View» BETH WILSON n March 1913, a devastating flood forever altered the landscape of the Ohio and Muskingum valleys. More than 400 people were killed, and thousands of homes were destroyed.

Foreword by Martin Fowler * In my early days in the software industry, one of the most awkward and tense moments of a software project was integration. Modules that worked individually were put together

Top Challenges in Payroll & HR About The Author Heading to her third year of experience within Accace, Maria Cojocariu is currently company s Payroll Manager, responsible with coordinating the local payroll

Using Logistics to Grow Your Business USING LOGISTICS TO GROW YOUR BUSINESS Y ou want your business to grow and logistics can help you achieve your objective. I ve put together this short guide for you

Online Marketing and Social Media ( Module 1 ) How the Internet has Impacted Marketing? The internet has developed very rapidly as a major force in the marketing equation for many consumer products. Not

7 Myths of Direct Mailing Think all your mail is delivered equally? Think again. Direct mail has the highest response rate and some of the best return on investment numbers of any marketing channel available

B.Com(Computers) II Year RELATIONAL DATABASE MANAGEMENT SYSTEM Unit- I 1 1. What is Data? A. Data is a collection of raw information. 2. What is Information? A. Information is a collection of processed

Mobile Device Management: Are You In Control? by Sarah Howland With remote access enabled by its MDM (mobile device management) solution, DirectTV installer MasTec has reduced device downtime to save more

Agent s Handbook Your guide to satisfied customers Introduction LiveChat is a tool that facilitates communication between a company and its customers. Agents who wield that tool use it to make customers

EPSON PERFECTION SCANNING BASICS SCANNING A DOCUMENT 1. start a new Word document 2. select Insert, Picture, From Scanner or Camera choose Web or Print quality, depending on what you are scanning a. if

By: James Iannelli RI Reputation Management www.reputationmanagementri.com (401) 316-2931 1 Introduction As a business owner, you already know that managing a business is a LOT of work; from keeping the

City of Ryde Drives Business Forward with Enterprise-wide Information Management Solution Effective Case Management in HP TRIM Improves Business Processes, Builds Foundation for Single View of Customer

I N T R O D U C T I O N THE NEW REALITIES OF SELLING Your present circumstances don t determine where you can go; they merely determine where you start. Nido Qubein WELCOME TO THE new world of selling!

v POS Checklist: Getting Started BarcodesInc www.barcodesinc.com 1.800.351.9962 What You'll Learn in this ebook The 7 core components of an effective and profitable POS system How a complete POS system

In This Chapter Chapter 1 A First Look at Call Centers Understanding what call centers are Following the evolution of call centers Knowing how call centers operate Differentiating the good and bad aspects

To ensure the functioning of the site, we use cookies. We share information about your activities on the site with our partners and Google partners: social networks and companies engaged in advertising and web analytics. For more information, see the Privacy Policy and Google Privacy &amp Terms.
Your consent to our cookies if you continue to use this website.