Lots of files, including images and books, have been uploaded here under the GODL-India template. Current trend seems to be that, editors can assume the existence of GODL if the item is sourced from a Govt website and the file matches, in the uploader's opinion, criteria suitable for GODL. But can a user site like Wikimedia Commons assume the existence of GODL when the Govt source site did not expressly license the item under GODL?

Let's take a closer look at the rules.

1. Paragraph 3 of the GODL notification specifies that the license is for data sets published under NDSAP and through the OGD Platform. This means that the data needs to be published in https://data.gov.in website, which is the OGD platform, for this license to be applicable.

2. Section 2(b) of the GODL notification expressly provides for a license by the Govt data provider: "Data Provider(s)" means person(s) publishing and providing the data under this license.

3. As per Section 6(b), this license is not applicable for Data that the data provider(s) is not authorized to license, that is data that is non-shareable and/or sensitive.

Sl. Nos. 2 and 3 rules out any assumption of this license by the user site when not expressly given by the data provider; there is no supercession clause that the nature of data (i.e., that fits in with Section 3) can override this requirement of license from the data provider.

4. Section 5 provides that attribution must specify Published under [Name of License]. So, how are we assuming GODL for files which were not published under this license?

5. Section 12(h) of the NDSAP notification provides that data and metadata under this license will be uploaded in data.gov.in website. So, how are we assuming this license for files from other sites?

6. Files under this license are available in the data.gov.in catalog (https://data.gov.in/catalogs); where is the documentation for applicability of this license for files not present in those catalogs?

7. Some states have published their data policies; their data are also uploaded in their respective subdomains of the data.gov.in site. As per Section 12(a) of the Odisha State Data Policy 2015, All State Government Departments will store their datasets at State Data Centre as a backup storage. This is the state's subdomain: https://odisha.data.gov.in.

8. NDSAP does apply to all data generated with public funds as per Section 5 of the NDSAP. But those data are needed to be classified first into Open Access, Registered Access and Restricted Access as per Section 8, by taking into consideration the various laws of the country as per Section 10. This classification is done by respective departments under overall monitoring of the national oversight committee, as per Section 7. Data of state governments are also classified in this way, under monitoring of state level steering commitees (see, for example, Section 12(k) of the Odisha state data policy: The SDSC will be the final authority to decide the classification of data into open, registered and restricted categories, apart from declaration of any non-sharable (negative) data.)

This is an elaborate exercise, only after which, the GODL license is assigned for the Open Access category.

Therefore, we need to decide whether Wikimedia Commons is a competent entity to assign GODL license to files not expressly published under this license; or we should use it only for files published under it.

This is required when missing pages are added to an existing index and pages in the Page: namespace are already created. Admin privilege is required for using the script. The script is for use in Linux. Below is described the process with Windows 7.

Go to GitHub (https://github.com/tshrinivasan/tools-for-wiki) and download tools-for-wiki-master.zip from the green coloured clone or download button; then extract it; Delete all folders except move-pages-bn-wikisource from the extracted folder. Now move the tools-for-wiki-master folder to C:\Windows\System32.

Right click on the config.ini file and choose Edit with Notepad++. Change like this:

wiki_username = Your user name

wiki_password = Your password

wikisource_language = en or as appropriate.

Also change the book settings:

book_name = Page:bookname.pdf (or djvu)

start_page_number = --

end_page_number = --

increment_order = --

Along with that, remove the GitHub copyright line in the footer (Don't edit with regular Notepad; that will insert Byte Order Mark at the start of the file)

Now, the actual work. It is in TIF format. You need a plug-in (see here for Windows) to see the pages. Now you can view the work page-by-page and save the pages as TIF images. This is of course very time consuming. How to download the whole work in PDF? Download DLI Downloader 0.24 Lite from this site and you are on. Just enter the barcode and download the book. If the downloader says that scan does not exist (may happen because of the presence or absence of "new" or "new1" at the start of the web address), then go to the Collection option and toggle between IISc and IUCAA. If your connection is slow, use safe mode. Experts can use more complicated tools found here.