GSA: index the e-commerce sites and display the results in sort by price?

We want to index few public e-commerce sites. When our customers search any one of the product the results, should display sort by pricing from all indexed e-commerce sites.

From My Understanding: The public e-commerce sites have different meta tag for pricing i cannot even consolidate into one meta tag. Is there possible to Feed via XML, but don't have much idea inside how to achieve? we don't have db access to parse only required data Via Entity recognition how i can able to index the price as a meta tag ?

Could u please advice us, whether it is achievable or not? If yes, which one is the best solution and refer document for this.

Best How To :

Ignoring the sorting issue and just concentrating on normalising the price metadata problem. You need a way to read the price from whatever metadata field it's in and create a new metadata field with a common name and the same value.

There are a few ways to do this but the simplest are probably:

Generate a Meta-and-url feed for each document and add in the normalised metadata

Crawl via a proxy that can add a X-GSA-External-Metadata header in containing the normalised metadata

You will want to break this up into tasks, but it is doable. Some words of warning: IP address can change or be shared (think University) but that may be fine for whatever you goal is. Check if IP address has been redirected before Use random check to determine if...

There are several things you could try: As mentioned in comments, the most efficient way is probably to use a lyrics API, such as http://api.wikia.com/wiki/LyricWiki_API. This would be fairly hard to do in VB.Net if you're not an experienced developer, but it might be possible with a WebRequest(). You could...

Concerning 1: The og:type should be website in your case I guess, instead of product. See https://developers.facebook.com/docs/reference/opengraph Concerning 2: You need to pass an absolute path for the og:image property, instead of a relative one. See https://developers.facebook.com/docs/opengraph/creating-custom-stories/#objecttypes-properties Concerning 3: I think this is caused by the wrong og:type...

Google--and any rational search engine--fudge the numbers, estimating how many results there are. It doesn't need to be perfect for a search engine. In fact, for them to actually enumerate the number of results would be slow and quite absurd, since most users don't leave the first page or look...

title (Note that the title element is not a "meta title".) HTML5 defines that the title element should identify documents "even when they are used out of context". So for a typical website, you should always include the site name in the title. For usability reasons, it’s most of the...

You should go to the URL parameters setting in WMT and set the variable rs to be ignored, by indicating that it does not change page content seen by users. Note that rel="canonical" is just a suggestion and not a directive. Google can choose to ignore it if it feels...

Yes, this is correct. You can insert in your template with <!DOCTYPE html> <html> <head> <title><%= title %></title> <meta name="description" content="<%= metaDescription %>"> Docs and examples can be found here: http://sailsjs.org/#!/documentation/concepts/Views/Locals.html http://sailsjs.org/#!/documentation/reference/res/res.view.html...

I didn't see index-follow tag in your html code. It's better to have it <meta name="robots" content="index, follow"> Also you can do two more things. Go to GWT > Crawl > Fetch as Google and submit some of your pages. Also click on the Sitemaps button in the left menu...

HTML5 doesn’t restrict how many meta elements you may have. We can never know for sure what specific consumers (like search engines) would like to see or how they handle it (and discussing this is off-topic on Stack Overflow), but there is no reason to assume that they’d have a...

I think this is because you use redirect in your .htaccess to remove the extension, i.e The R flag, try this code: RewriteEngine On RewriteCond %{REQUEST_FILENAME} !-f RewriteRule ^([^\.]+)$ $1.php [NC,L] In addition, from your meta tags code you have: <meta property="og:url" content="http://www.ourchurch.com/fall_retreat.php"/> which should be: <meta property="og:url" content="http://www.ourchurch.com/fall_retreat"/> Because...

Finally I manage to figure it out!!! if you are using google search settings in hebrew this awesome feature is not available, so I changed the search settings to english and it works great! https://www.google.com/preferences small update thanks @Sanook it looks like the instant results also need to be enabled,...

Google has implemented lots of safeguards to ensure that it's search engine can't be scraped. However, Google must still work, that's the whole point. So the best way to do google scraping I've found so far is to control a real web browser. There's Selenium if you want to go...

The F12 tools setting overrides the page default and persists for the life of the page or the life of the F12 tools session. If you'd rather not choose the default from the document mode drop-down, you can do one of two things: Copy the URL into the address bar...

The default action of the pull-to-refresh effect can be effectively prevented by doing any of the following : preventDefault’ing some portion of the touch sequence, including any of the following (in order of most disruptive to least disruptive): a. The entire touch stream (not ideal). b. All top overscrolling touchmoves....

Apart from a few exceptions, the order of childs in the head element doesn’t matter. That said, consumers like search engines may of course do what they want (e.g., ignoring every third element, just for the fun of it), but discussing the possible behaviour of undesignated consumers is off-topic here....

Google usually fairly quickly crawls your pages. Inclusion into index is a bit slower, and getting reasonable search rank takes time. Look at your web server log to confirm that google bot did crawl your pages, you can search for exact page in google and it usually comes up, but...

Unfortunately it doesn't work the way you think. Connecting your website with your Facebook fanpage doesn't mean all the likes will be passed one to another. The fb:admins meta tag just allows you to getting access to the data available via Facebook Insights. In case of a need of having...

I didnt see any mention of "Managed SMF hosting" on your pages, so why would you hope to rank for it ? http://static.googleusercontent.com/media/www.google.co.uk/en/uk/webmasters/docs/search-engine-optimization-starter-guide.pdf...

If you are using the script element as data block, "the src attribute must not be specified". If the script element is not used as data block, it has to be "used to include dynamic scripts". But a JSON-LD document is not a dynamic script. For linking to another resource,...

I tried using Jsoup and it worked, although the first few results include some undesired characters. Below is my code package crawl_google; import org.jsoup.Jsoup; import org.jsoup.nodes.Document; import org.jsoup.nodes.Element; import org.jsoup.select.Elements; public class googleResults { public static void main(String[] args) throws Exception{ //pass the search query and the number of results...

Google will use a number of approaches when building their databases and one cannot say exactly how you would get your site to register within google as a news sight. However you will notice the following meta information within many articles that show up on google news. <meta property="og:type" content="article"...

What you're trying to do is called web scraping, or trying to pull out content from websites by pretending you loaded a page through a browser, and then accessing the loaded content by looking into and picking out bits and pieces of the page's code. This can work quite well...

Google Search may change webpage titles they show on their result page. You can’t control this. About your alt content: Is the page about "Logo Seminars", or does "Logo" mean that the image is the logo? In the latter case, you might want to remove "Logo" from the alt content...

print a php variable with single quote '$Description' it assume as string not a variable so you should do wrap it in double quotes "$Description". Change echo '$Description'; to echo "$Description"; Or simple echo the variable echo $Description; ...

You need to float search bar container in order to align it with navbar menu items. For this purpose you can simply add pull-left (or pull-right) class to the search bar div. You will also need to set some fixed width like col-sm-6 for 50% width, otherwise it's 100% by...

You can configure the Goolge custom search engine to only display search results. Then you only have to include the Google script on the results page, eg. results.html. On other pages you can place a generic search form with the results page as action: <form action="/results.html" method="GET"> <input name="q" type="text">...

The reason in my case was because there was a temporary cashing problem since my domain was linked to my facebook account for many years before linking it to an actual site. I had to wait for many days.

The http-equiv you can think of it like an instruction for the browser (refresh, Set-Cookie, expiers)...by setting an http-equiv property you are changing browser behavior. For name attribute, on the other hand, you are simply describing something (title, keywords, etc.). An action and a description are different types of ideas....

No, in your case it will not be perceived as a bad thing, because you are doing it for users, to describe your products. You are not trying to spam or manipulate rankings. By default, a microsite is not a bad thing for Google. It is not automatically promoted or...

If you would take a look at the Google results for a search query site:distancesbetween.com you would see 654000 results, which basically means that most of the links generated are indexed by Google. As Rob already mentioned, you can find links to the popular searches on the website and each...

You can use JSON-LD instead of microdata. That way you add everything you want in JSON data that is not displayed in the page, but recognized by most search engines. Here is an example taken from http://schema.org/CreativeWork: <script type="application/ld+json"> { "@context": "http://schema.org", "@type": "CreativeWork", "name": "My Article", "image": "http://your.image.url.com" }...

I noticed this is now working for petmd.com so, unless you changed anything, I suspect you just didn't give it long enough for Google to pick this up. To give some idea of the time frames involved for anyone else looking for this I'll note my experiences below (note this...

I solved it, there is a problem with no script tag. If you include it in your head. When Facebook debug the url and found NoScript tag in the head then it will put all the meta tags and script inside body of the document And we will get a...

Assuming you are using Capybara, you can do what was described in this post: page.should have_css 'meta[name="description"]', :visible => false or page.find 'meta[name="description"]', :visible => false Capybara by default does not work on elements not directly visible to the user (title is visible at the top of the browser/tab, so...

Allright, it appears I've solved it. The loaded document was not the same as when viewing in Chrome because the UserAgent was different. I changed: document = Jsoup.connect(Url).get(); into: document = Jsoup.connect(Url).userAgent(myUserAgent).get(); ...

nslookup google.com is the easiest way. Works on Linux and Windows. If your issue is that your DNS server is not returning a response for google.com, use a different DNS server, such as a public one. For instance, you could do: nslookup >server 8.8.8.8 >google.com For a more thorough solution...

There are a couple of datasets like this: Yahoo Weboscope:- http://webscope.sandbox.yahoo.com/catalog.php?datatype=l Yandex Datasets:- https://www.kaggle.com/c/yandex-personalized-web-search-challenge/data A part of Kaggle problem. You can sign up and download. There are also AOL Query Logs and MSN Query Logs which had been publicised as part of shared tasks in past 10 years. I'm not...

The Description Meta tag is generally handled by the template header (header.php) or by a Plugin that is adding the description to the site (Such as SEO Title Tag). Since you are getting a duplicate description, you should check for plugins that are outputting a description tag. For other annoying...