This page in a nutshell: Summarize in your own words instead of closely paraphrasing. Closely paraphrased material that infringes on the copyright of its source material should be rewritten or deleted to avoid infringement, and to ensure that it complies with Wikipedia policy. Public domain sources and CC-BY-SA-compatible sources may be closely paraphrased, and limited close paraphrasing of copyrighted sources may also be permitted as fair use. Attribution is always required.

Close paraphrasing is the superficial modification of material from another source. Editors should generally summarize source material in their own words, adding inline citations as required by the sourcing policy.

Limited close paraphrasing is appropriate within reason, as is quoting, so long as the material is clearly attributed in the text – for example, by adding "John Smith wrote ...", together with a footnote containing the citation at the end of the clause, sentence or paragraph. Limited close paraphrasing is also appropriate if there are only a limited number of ways to say the same thing.

Close paraphrasing without in-text attribution may constitute plagiarism, and when extensive (with or without in-text attribution) may also violate Wikipedia's copyright policy, which forbids Wikipedia contributors from copying material directly from other sources. Public domain material must likewise be attributed to avoid plagiarism. If the source material bears a free copyright license that is compatible with Wikipedia's licenses, copying or closely paraphrasing it is not a copyright violation so long as the source is attributed somewhere in the article, usually at the end.

The best way to prevent close paraphrasing is to understand clearly when it is a problem, how to avoid it, and how to address it when it appears.

Wikipedia's primary concern is with the legal constraints imposed by copyright law. Close paraphrasing of the creative expression in a non-free copyrighted source is likely to be an infringement of the copyright of the source. In many countries close paraphrasing may be also seen as mutilation or distortion of an author's work, infringing on their moral rights.

The Llama is a woolly sort of fleecy hairy goat, with an indolent expression and an undulating throat; like an unsuccessful literary man.

If this somewhat dubious source was used for the article on llamas and was still protected by copyright, it would be acceptable to say that the llama is an animal with a shaggy coat, and perhaps that it has a long neck. These are facts. But use of the phrases "indolent expression" and "undulating throat" might violate copyright. The original choice of words is part of Belloc's creative expression. Going further, the simile "like an unsuccessful literary man" is also creative, and is also protected. A clumsy paraphrase like "resembling a failed writer" might violate copyright even though the words are entirely different. More than the facts have been copied.

It is of course also necessary that other requirements for copyright violation also exist, such as being a "substantial" taking.

Translation from a foreign language is a form of paraphrase, since all the words or phrases have been replaced with equivalent English-language words or phrases. This may or may not be acceptable, depending on whether any creative expression – anything other than simple statements of fact – has been taken from the foreign language source. For example, consider two translations from the Turkish language:

"Istanbul is a large city"

"The sun looms through the haze like a red omen"

The first is a simple statement of fact and should be acceptable. The second carries over the figurative expressions "looms through" and "like a red omen", so presumably is not acceptable despite using completely different words from the original. But even if you only carry across statements of fact, the more you translate and the more closely you translate, the more likely you are to create a copyright problem.

Although facts are not subject to copyright, a selection or arrangement of facts may be considered creative and therefore protected. For example, an alphabetical list of states in the US giving their name, size and population cannot be copyrighted. However, a shorter list of states giving the name, size and population as before, but ranked as the "top most livable states" would be subject to copyright. The selection and ranking is creative.[1][2]

Wikipedia does not have an official policy regarding moral rights of authors.

The "moral rights" of an author are independent of copyright ownership.
They include the author's right to control first publication of a work; the author's right to be attributed or to remain anonymous; the author's right for the work to be published without distortion or mutilation. As with copyright, moral rights apply to creative expression but not to mere facts. Respecting moral rights can help ensure that Wikipedia content can be reused as widely as possible.

In accordance with verifiability policy, Wikipedia editors should not use unpublished work (note: unpublished work in public collections may be suitable). With published work, editors should attribute each source to the author where the publication names the author, and attribute the source to the publication if it does not name the author. It is sometimes relevant for an article to include a short quotation such as a significant statement made by the subject of the article or a notable comment about the subject. In these cases a verbatim quotation should be given rather than a paraphrase. Quotations should be used very sparingly[vague] and should be clearly identified and formatted as defined in MOS:QUOTE.

The US Copyright Office states that, "Copyright law does not protect names, titles, or short phrases or expressions... The Copyright Office cannot register claims to exclusive rights in brief combinations of words ... To be protected by copyright, a work must contain a certain minimum amount of authorship ... Names, titles, and other short phrases do not meet these requirements."[3] However, if a source creatively combines, selects or arranges names, titles, short phrases or expressions, following it too closely may infringe on its copyright.

Paraphrasing rises to the level of copyright infringement when there is substantial similarity between an article and a copyrighted source. This may exist when the creative expression in an important passage of the source has been closely paraphrased, even if it is a small portion of the source, or when paraphrasing is looser but covers a larger part of the source or covers "the heart" (the most essential content). A close paraphrase of one sentence from a book may be of low concern, while a close paraphrase of one paragraph of a two-paragraph article might be considered a serious violation. Editors must therefore take particular care when writing an article, or a section of an article, that has much the same scope as a single source. The editor must be especially careful in these cases to extract the facts alone and present the facts in plain language, without carrying forward anything that could be considered "creative expression".

Under US copyright law, however, substantial similarity does not always indicate infringement. It does not indicate infringement, for instance, where the doctrine of fair use permits the use of the material.[4] Wikipedia deliberately adopts a narrower limitation and exception from copyright than fair use. Our policy and guideline are set out at Wikipedia:Non-free content.

Substantial similarity is also immaterial when strong evidence exists that the content was created independently.[5] An author may think they are being original when they write "Charles de Gaulle was a towering statesman", not realizing that many other authors have independently come up with these identical words. What looks like copying or close paraphrasing may thus be accidental. These similarities are more likely to exist where content is less creative and more formulaic. Independent creation is less likely when there is evidence that the source was consulted or close following is extensive.

Even when content is verifiably public domain or released under a compatible free license, close paraphrasing may be at odds with Wikipedia's guideline related to plagiarism (see Wikipedia:Plagiarism). While in this context, too, close paraphrasing of a single sentence is not as much of a concern, if a contributor closely paraphrases public domain or freely licensed content, he or she should explicitly acknowledge that content is closely paraphrased. (See below.)

Another potential problem arises when a contributor copies or closely paraphrases a biased source either purposefully or without understanding the bias. This can make the article appear to directly espouse the bias of the source, which violates our neutral point of view policy.

There are a few specific situations when close paraphrasing is permitted. If information is gathered from the public domain or is free use content, close paraphrasing may be acceptable. In some instances it is helpful to capture the words as written, in which case the guidelines for Quotations apply. Lastly, there may be some instances where it's difficult to paraphrase because of the nature of the content; in such cases, there are a couple of tips below about how to limit the degree of close paraphrasing to avoid issues.

When using a close paraphrase legitimately, citing a source is in most cases required and highly recommended.[6]

In some limited cases, close paraphrasing may be an acceptable way of writing an article. For example, many Wikipedia articles are (or were) based on text from the 1911 Encyclopaedia Britannica (see Wikipedia:1911 Encyclopaedia Britannica). If the source is in the public domain, such as work of the U.S. government, or is available under a license compatible with the CC-BY-SA license (a partial table of license compatibility can be found at the Copyright FAQ), then the source may be closely paraphrased if the source is appropriately attributed. Attribution in such instances may include in-text attribution that makes clear whose words or ideas are being used (e.g. "John Smith wrote that ...") or may include more general attribution that indicates the material originates from a free source, either as part of an inline citation or as a general notice in the article's "References" section (for further information on how to attribute free sources, see Wikipedia:Plagiarism#Copying material from free sources).

Limited quotation from non-free copyrighted sources is allowed, as discussed in Wikipedia's non-free content policy and guideline. Quotations should have in-text attribution and should be cited to their original source or author (see WP:When to cite). With direct quotation, editors should clearly distinguish the quoted material from the original text of the article following the guidelines for quotations. Extensive use of quotation from non-free sources is generally not acceptable. Even if content is attributed, it can still create copyright problems if the taking is too substantial. To avoid this risk, Wikipedia keeps this—like other non-free content—minimal.

Quotation from non-free sources may be appropriate when the exact words in the source are relevant to the article, not just the facts or ideas given by the source. Examples may include statements made by a person discussed in the article; brief excerpts from a poem, song, or book described in the article; or significant opinions about the subject of the article. Quotation should not, however, be treated as an alternative to extracting facts and presenting them in plain language. Thus:

Close paraphrasing is also permitted when there are only a limited number of ways to say the same thing. This may be the case when there is no reasonable way to avoid using technical terms, and may also be the case with simple statements of fact.

Names or titles of people, organizations, books, films and so on may be given in full – there is no creative expression in a name or title, which is often the only way to identify the subject. Short catchphrases, slogans or mottos may also be reproduced where relevant to the discussion. It is acceptable to use a technical term such as "The War of the Spanish Succession" or "Relational Database Management System (RDBMS)" when the term is almost always used by sources that discuss the subject, and when such sources rarely use any other term. In this case, the technical term is considered to be "merged" with the idea expressed. There is no reasonable alternative way of expressing the idea, and since ideas are not subject to copyright the term is also not protected. However, if different sources use different terms for the concept, it may be best for the article to use a different term from the source or to include the term in a sourced quote.

An example of closely paraphrased simple statements of fact is given by a biography that relies on two sources for the basic outline. The sources and the article start with:

Source1: John Smith was born in Hartford, Connecticut on February 2nd 1949... He attended State University, obtaining an M.D. in 1973.

Source2: John Smith was born on 2 February 1949 in Hartford... He graduated with a medical degree from State University in 1973.

Article: John Smith was born on 2 February 1949 in Hartford, Connecticut... He studied medicine at State University, and earned an M.D. in 1973.

In this example, the wording of the article is very close to that of both sources. However, the article merely presents standard facts for a topic like this in standard sequence. The article does not copy any creative words or phrases, similes or metaphors, and makes an effort at paraphrasing in the second sentence. Just two short sentences are close to the sources. For these reasons the close paraphrasing should be acceptable.
Note, however, that closely paraphrasing extensively from a non-free source may be a copyright problem, even if it is difficult to find different means of expression. The more extensively we rely on this exception, the more likely we are to run afoul of compilation protection.

"A statement from the receiver, David Carson of Deloitte, confirmed that 480 of the 670 employees have been made redundant ... At least 100 Waterford Crystal employees are refusing to leave the visitors' gallery at the factory tonight and are staging an unofficial sit-in. The employees say they will not be leaving until they meet with Mr. Carson. There were some scuffles at one point and a main door to the visitors' centre was damaged ... Local Sinn Féin Councillor Joe Kelly, who is one of those currently occupying the visitors' gallery, said the receiver had told staff he would not close the company while there were interested investors."

"A statement issued by the receiver, Deloitte's David Carson, confirmed that, of the 670 employees, 480 of them would be laid off. The workers responded angrily to this unexpected decision and at least 100 of them began an unofficial sit-in in the visitors' gallery at the factory that night. They insisted they would refuse to leave until they had met with Carson. Following the revelations, there was a minor scuffle during which the main door to the visitors' centre was damaged. Local Sinn Féin Councillor Joe Kelly was amongst those who occupied the visitors' gallery."

Analysis:

"A statement issued by the receiver, Deloitte's David Carson, confirmed that, of the 670 employees, 480 of them would be laid off" vs. "A statement from the receiver, David Carson of Deloitte, confirmed that 480 of the 670 employees have been made redundant".

The structure of Wikipedia's statement is essentially the same as the original. Changing a single word and slightly reordering one phrase is not enough to constitute a paraphrase.

"They insisted they would refuse to leave until they had met with Carson" vs. "The employees say they will not be leaving until they meet with Mr. Carson".

The structure of this sentence is the same as the original with too much similarity within the structure of the paragraph.

"there was a minor scuffle during which the main door to the visitors' centre was damaged" vs. "There were some scuffles at one point and a main door to the visitors' centre was damaged".

The structure and language of the two sentences are the same.

"Local Sinn Féin Councillor Joe Kelly was amongst those who occupied the visitors' gallery" vs. "Local Sinn Féin Councillor Joe Kelly, who is one of those currently occupying the visitors' gallery".

This slight rewording does not change the fact that the underlying structure and language are the same. Minor changes, such as "was amongst those" --> "is one of those" and "occupied" --> "currently occupying", are not enough to constitute an original rewriting of the passage.

See below for an example of an unusable paraphrase repaired to become acceptable.

To properly paraphrase content, you review information from reliable sources, extract the salient points, and use your own words, style and sentence structure to draft text for an article.[9][10]

Take notes

One of the key factors in the creation of inadvertent close paraphrasing is starting with text taken directly from the source. The word choice and style can easily resurface since it is foremost in our minds. An approach to ensure that information is fully understood and formed into unique thoughts is to isolate the essential information by taking notes.

Start off by taking notes of essential information only, excluding the use of phrases.

Collect information from multiple sources. This will have several benefits: it promotes a tone with a neutral point of view and produces a well-rounded understanding of the topic. It also makes it less likely that your end result will follow too closely on any one source.

Gather short quotations when they powerfully illustrate a point for your article. Overuse can result in a disjointed article and may breach copyright. (Extensive quotations are forbidden by policy.)

Write a draft

Gather related items from the multiple sources and explain it to yourself: The point is to rephrase or summarize a body of information in your own words and sentence structure.

Words or ideas do not follow the same pattern and order as the source material.

The example above on this page illustrates a common way in which people closely paraphrase content; this one demonstrates how to properly synthesize and paraphrase information.

Example: close paraphrasing repaired

Consider the following example of a close paraphrase (unacceptable version) and ways to correct it to make an acceptable version:

Example

Wording

Comments

Source

Born in Slavery: Slave Narratives from the Federal Writers' Project, 1936–1938 contains more than 2,300 first-person accounts of slavery and 500 black-and-white photographs of former slaves. These narratives were collected in the 1930s as part of the Federal Writers' Project of the Works Progress Administration (WPA) and assembled and microfilmed in 1941 as the seventeen-volume Slave Narratives: A Folk History of Slavery in the United States from Interviews with Former Slaves. This online collection is a joint presentation of the Manuscript and Prints and Photographs Divisions of the Library of Congress and includes more than 200 photographs from the Prints and Photographs Division that are now made available to the public for the first time. Born in Slavery was made possible by a major gift from the Citigroup Foundation. Source: Born in Slavery, Library of Congress

Unacceptable version

A collection of more than 2,300 accounts of slavery taken directly from former slaves and 500 black-and-white photographs make up the Born in Slavery: Slave Narratives from the Federal Writers' Project, 1936–1938. In the 1930s these narratives were compiled in the 1930s as part of the Federal Writers' Project of the Works Progress Administration (WPA). They were aggregated and microfilmed in 1941 as the seventeen-volume Slave Narratives: A Folk History of Slavery in the United States from Interviews with Former Slaves. The collection is a united effort of the Manuscript and Prints and Photographs Divisions of the Library of Congress. Citigroup Foundation made the Born in Slavery possible by a major gift.

The structure of this version is essentially the same as the original. Changing a few words and slightly reordering phrases is not enough to constitute a paraphrase.

Correcting issues – step 1) Bring in other source(s).

The Slave Narrative Collection provides a unique and virtually unsurpassed collective portrait of a historical population. Indeed, historian David Brion Davis has argued that the voluminous number of documented slave testimonies available in the United States "is indisputably unique among former slaveholding nations." In addition to the substantial number of life histories it contains, the most compelling feature of the collection is the composition of the sample of people who made up its informants. Although not a representative sample of the slave population, they were a remarkably diverse and inclusive cross-section of former slaves. Those whose voices are included in the collection ranged in age from one to fifty at the time of emancipation in 1865, which meant that more than two-thirds were over eighty when they were interviewed. Source: A Collective Portrait

Bringing in other sources helps to ensure that there's a good understanding of the topic and a neutral point of view.

In the 1930s a Works Progress Administration (WPA) project, called Federal Writers' Project, was conducted to capture the history record of people born into slavery. At the time of the project 2/3rds of the more than 2,300 men and women interviewed were over the age of eighty, having been one to fifty years of age when they obtained their freedom in 1865. Over 500 black and white photographs were taken of interview subjects. The Library of Congress Manuscript and Print and Photograph Divisions assembled a seventeen-volume collection from the set, called "Born in Slavery: Slave Narratives from the Federal Writers' Project, 1936–1938." David Brion Davis, a historian, claimed that the large collection "is indisputably unique among former slaveholding nations."

This version brings together information from two sources, without maintaining the previous sentence structure or style.

Unlike straightforward copyright violations, close paraphrasing is notoriously difficult to detect; frequently the contributor will add wiki syntax and write in the style of a Wikipedia article (as indeed they should). Here are some ways you might detect it:

Look for disjointed and sudden changes in the tone, vocabulary, and style of content introduced by the same contributor. For example, "The cat is a small predatory carnivorous species of crepuscular mammal. Housecats like to kill mice and bats."

Look for redundant content; this may be a sign that two or more sources were closely paraphrased. For example, "The cat is a small predatory carnivorous species of crepuscular mammal. Like many pets, domestic cats are carnivorous."

Look for content that resembles content included in a quotation.

Examine the talk pages of major contributors and other pages where they have written in their own words, and determine if their article contributions substantially differ in tone, structure, and vocabulary from these discussions.

Take short phrases from the article and put them in a search engine. Take a look at the results and see if they closely resemble the article.

Your approach may vary depending upon the severity of the concern. Here are a couple of ways to manage close-paraphrase concerns.

Flag the problem

Insert a dispute template and/or engage in a copyright-infringement discussion: Your approach here may depend upon the extensiveness of the issues you discover.

You can use the {{Close paraphrasing}} template, which can be customized to identify the source and to indicate if the source is public domain, to mark it for cleanup (and usually one would also open a talk page discussion about the matter):

This article or section contains close paraphrasing of one or more non-free copyrighted sources. Ideas in this article should be expressed in an original manner. See the talk page for details.

However, if you believe that the close paraphrasing in question is so close that it infringes copyright, instead follow the instructions at Template:Copyvio, which may require removing the paraphrasing content until it can be repaired. Unless close paraphrasing is immediately obvious, it is good practice to cite specific passages alongside the corresponding passage from the source on the talk page to highlight their similarity; this will provide objective evidence of close paraphrasing.

Notice to the contributor

It is important to discuss your concerns with the contributor. Many people who paraphrase too closely are not intentionally infringing, but just don't know how to properly paraphrase. It might help to point them to this essay or to the references and resources listed here, which include some pointers for proper paraphrasing.

Note: All text in these examples is dedicated by its authors to the public domain

The following example messages can be copied and pasted directly from this page, although you will need to fill in your own example close paraphrases as well as supplying the article's title and the source URL. The messages strive to avoid accusations while at the same time pointing to clear instructions on how to fix errors of this sort. The spaces for examples from the editor's inappropriate text are provided because even experienced or good faith editors may not recognize where the issues lie without them. If there is a passage of several consecutive sentences which is a continuous close paraphrase, this may alone be a sufficient demonstration. Otherwise, showing the pattern in several separated sentences is typically better than offering one, brief example.

The following example was engineered for cases when the paraphrasing is close enough to require blanking of the article and listing at Wikipedia:Copyright problems. It would not be appropriate for situations where the {{close paraphrasing}} template is used, since rewriting can be done on the spot rather than in a temporary page. You may use this example verbatim, if you wish, but may and should modify it if it is not completely appropriate to the circumstances.

Hi. I'm afraid the [[ArticleName]] article you wrote may be a problem under our copyright policies, since the text seems very [[Wikipedia:Close paraphrasing|closely paraphrased]] from [source]. While facts are not copyrightable, creative elements of presentation – including both structure and language – are. For an example of close paraphrasing, consider the following:

:example from source
The article says:
:example from article
There are other passages that similarly follow quite closely.

As a website that is widely read and reused, Wikipedia takes copyright very seriously to protect the interests of the holders of copyright as well as those of the Wikimedia Foundation and our reusers. Wikipedia's [[Wikipedia:Copyrights|copyright policies]] require that the content we take from non-free sources, aside from brief and clearly marked quotations, be rewritten from scratch.

The article has been replaced with a notice of these copyright concerns that includes directions for resolving them. If the material can be verified to be [[Wikipedia:Compatible license|compatibly licensed]] or [[Wikipedia:Public domain|public domain]] or if [[Wikipedia:Copyright problems#Copyright owners who submitted their own work to Wikipedia (or people editing on their behalf)|permission is provided]], we can use the original text with proper attribution. If you can resolve it that way, please let me know if you need assistance with those directions. Otherwise, so that we can be sure it does not constitute a derivative work, this article should be rewritten; there is a link to a temporary space for that purpose in the instructions which now appear in place of the article. The essay [[Wikipedia:Close paraphrasing]] contains some suggestions for rewriting that may help avoid these issues. The article [[Wikipedia:Wikipedia Signpost/2009-04-13/Dispatches]], while about plagiarism rather than copyright concerns, also contains some suggestions for reusing material from sources that may be helpful, beginning under "Avoiding plagiarism".

This example was engineered for cases when the paraphrasing is not enough of a concern to require blanking and listing and the {{close paraphrasing}} template is used instead. You may use this example verbatim, if you wish, but may and should modify it if it is not completely appropriate to the circumstances.

Hi. I'm afraid the [[ArticleName]] article you contributed to has parts which are very [[WP:close paraphrasing|closely paraphrased]] from [source]. This can be a problem under both our [[Wikipedia:Copyrights|copyright policy]] and our [[Wikipedia:Plagiarism|guideline on plagiarism]].

While facts are not copyrightable, creative elements of presentation – including both structure and language – are. For an example of close paraphrasing, consider the following:
The source says:
:example from source
The article says:
:example from article
This is an example; there are other passages that similarly follow quite closely.

As a website that is widely read and reused, Wikipedia takes copyright very seriously to protect the interests of the holders of copyright as well as those of the Wikimedia Foundation and our reusers. Wikipedia's [[Wikipedia:Copyrights|copyright policies]] require that the content we take from non-free sources, aside from brief and clearly marked quotations, be rewritten from scratch. So that we can be sure it does not constitute a [[derivative work]], this article should be revised to separate it further from its source. The essay [[Wikipedia:Close paraphrasing]] contains some suggestions for rewriting that may help avoid these issues. The article [[Wikipedia:Wikipedia Signpost/2009-04-13/Dispatches]] also contains some suggestions for reusing material from sources that may be helpful, beginning under "Avoiding plagiarism".

Substantial similarity (article) – Substantial similarity is the standard developed and used by United States courts to determine whether a defendant has infringed the reproduction right of a copyright.

Scènes à faire (article) – Sometimes there are only so many ways to say something.

^In Feist Publications v. Rural Telephone Service, the United States Supreme Court noted that factual compilations of information may be protected with respect to "selection and arrangement, so long as they are made independently by the compiler and entail a minimal degree of creativity," as "[t]he compilation author typically chooses which facts to include, in what order to place them, and how to arrange the collected data so that they may be used effectively by readers"; the Court also indicated that "originality is not a stringent standard; it does not require that facts be presented in an innovative or surprising way" and that "[t]he vast majority of works make the grade quite easily, as they possess some creative spark, 'no matter how crude, humble or obvious' it might be."("Decision". Feist Publications, Inc., v. Rural Telephone Service Co., 499 U.S. 340 (1991).)

^But also "thick protection is the norm, but when there are very few articulable, concrete similarities based on protected aspects of a work and a limited number of ways in which the underlying ideas could be expressed differently, or where the only protectable aspect of a work is the 'unique selection and arrangement' of otherwise unprotectable elements, a work is entitled only to 'thin' protection, where 'virtually identical copying' is required to support a finding of infringement." [https://www.documentcloud.org/documents/2447394-jerseyboys.html Robert Jones, in the Jerseyboys case.

^An exception would be when closely paraphrasing a compatibly licensed source that is not permissible as a citation. For instance, one may closely paraphrase another Wikipedia article or use content from another compatibly licensed user-generated wiki, so long as the content meets core content policies. Attribution may be required, as explained at Wikipedia:Plagiarism#Copying material from free sources, but citing it is a source might be against policy.