If this is your first visit, be sure to
check out the FAQ by clicking the
link above. You may have to register
before you can post: click the register link above to proceed. To start viewing messages,
select the forum that you want to visit from the selection below.

Re: 100 million word British National Corpus.

Interesting thoughts...keep it going...TDOL, when you say you get your students to do (?) concordance checks on Google, how is it done? I am very new to this whole concordance data business but I like what I see...very interesting...

Re: 100 million word British National Corpus.

Originally Posted by tdol

It has many limitations, but it is a very good way to get people started, and is good for frequency, geography, etc. If people look carefully at the results, they might be able to amalyse register themselves- if it's all fromm business texts, say, then that gives clues. It doesn't offer the clever searches, though.

www.onelook.com is another handy tool with wildcards for basic things like prefixes and suffixes.

Which limitations have you identified and is there a better free source?

<If people look carefully at the results, they might be able to amalyse register themselves- >

Even NNES?

<It doesn't offer the clever searches, though. >

For example?

<www.onelook.com is another handy tool with wildcards for basic things like prefixes and suffixes. [/>

Re: 100 million word British National Corpus.

Originally Posted by Jing-Jo

Interesting thoughts...keep it going...TDOL, when you say you get your students to do (?) concordance checks on Google, how is it done? I am very new to this whole concordance data business but I like what I see...very interesting...

Cheers

Firstly, it's amazing how many people don't know how to use the " " in a search for a complete phrase. Armed with this, a student can do a simple comparison. Let's say we want to compare complete and utter with the word disgrace:

We do a search for "complete disgrace", using the inverted commas: 7,220, then we do "utter disgrace": 13,400. It is clear that both are correct, but 'utter' is more common. Now, let's try it with something else:

Re: 100 million word British National Corpus.

Originally Posted by tdol

Firstly, it's amazing how many people don't know how to use the " " in a search for a complete phrase. Armed with this, a student can do a simple comparison. Let's say we want to compare complete and utter with the word disgrace:

We do a search for "complete disgrace", using the inverted commas: 7,220, then we do "utter disgrace": 13,400. It is clear that both are correct, but 'utter' is more common. Now, let's try it with something else:

This kind of use of Google as a language tool is easy and useful, but it has limitations. I think it's a good start.

<This kind of use of Google as a language tool is easy and useful, but it has limitations. I think it's a good start.>

Yes, I agree, a good start. For more advanced students though, I think a true concordancer is a necessary tool. Also, Google searches both native and nonnative examples of English, so cannot always be relied on to provide correct usage.
-----------------

I can't find any examples of "complete disgrace" in the BNC.

<We do a search for "complete disgrace", using the inverted commas: 7,220, then we do "utter disgrace": 13,400. It is clear that both are correct, but 'utter' is more common. >

But ones searches are only as good as the words one can think of as collocates.

E.G.

The BNC gives "absolute" as the main adjective collocate for the noun "disgrace" (adj + noun collocation). On BNC, you get to that by using the tag [aj*] followed by a space and then the noun. How do I do that on Google?

We see that the strongest collcation is actually "absolute disgrace" in both Google and the BNC. That isn't so surprising as "absolute" is a more forceful adjective than "complete", and I suppose "utter", and it collocates well with an equally forceful noun such as "disgrace".

Taking all that back to the BNC, I can now check if there are differences between which collocation pair appears in a certain register, regarding frequency.

Spoken register:

ABSOLUTE DISGRACE 9
UTTER DISGRACE 2
COMPLETE DISGRACE 0

Fiction register:

UTTER DISGRACE 1
ABSOLUTE DISGRACE 1
COMPLETE DISGRACE 0

News Register:

ABSOLUTE DISGRACE 5
UTTER DISGRACE 5
COMPLETE DISGRACE 0

So it appears that UTTER DISGRACE is less popular in spoken English, but equally popular in fiction and the news.

Interesting, innit? Interesting also that "absolute" (adj) is far more common that "utter" (adj) in the BNC.

N.B. To me, "complete disgrace" doesn't feel like BE. It may be, but I don't use it.

But then again, from a search on Google.co.uk (pages from the UK only), I get this:

Re: 100 million word British National Corpus.

Originally Posted by M56

So you can see why I have a preference for the BNC.

The BNC obviously has tools that target language, which give it a clear edge over Google; I am only proposing Googling as a way of getting people into the basic idea. However, one thing in Google's favour is that it has the widest and most inclusive database of any tool.

Re: 100 million word British National Corpus.

Originally Posted by tdol

The BNC obviously has tools that target language, which give it a clear edge over Google; I am only proposing Googling as a way of getting people into the basic idea. However, one thing in Google's favour is that it has the widest and most inclusive database of any tool.

<However, one thing in Google's favour is that it has the widest and most inclusive database of any tool. >