SEO cu Robots.txt
Asigurati-va ca fisierul robots.txt este pe serverul vostru. Daca nu va trebui sa il realizati sau sa il modificati. Ce este de fapt robots.txt? Nu am sa incerc sa va dau o definitie ci mai degraba sa va fac sa intelegeti importanta unui astfel de fisier pentru optimizarea unui site….si nu numai. Fisierul robots.txt spune spiderilor ce pagini sa fie indexate sau nu.
Googlebot and Robots.txt
Cand va decideti ce pagini sa fie indexate Googlebot merge in ordinea:
1. Googlebot va asculta primul record din fisierul robots.txt cu User-agent care incepe cu ”Googlebot”. 2. Daca nu exista ”Googlebot” User-agent va asculta de prima linie care are declaratia User-agent ”*”.
Robots.txt pentru Wordpress
User-agent: *
# nu permite indexarea tuturor file-urilor existente in directoarele respective
Disallow: /cgi-bin/
Disallow: /z/j/
Disallow: /z/c/
Disallow: /stats/
Disallow: /dh_
Disallow: /about/
Disallow: /contact/
Disallow: /tag/
Disallow: /wp-admin/
Disallow: /wp-includes/
Disallow: /contact
Disallow: /manual
Disallow: /manual/*
Disallow: /phpmanual/
Disallow: /category/
User-agent: Googlebot
# nu permite indexarea fisiere care se termina cu respectivele extensii
Disallow: /*.php$
Disallow: /*.js$
Disallow: /*.inc$
Disallow: /*.css$
Disallow: /*.gz$
Disallow: /*.wmv$
Disallow: /*.cgi$
Disallow: /*.xhtml$
# nu indexeaza file-urile cu ? in url
Disallow: /*?*
# disable duggmirror
User-agent: duggmirror
Disallow: /
# permite google image bot sa caute toate imaginile
User-agent: Googlebot-Image
Allow: /*
# permite adsense bot in tot site-ul
User-agent: Mediapartners-Google*
Allow: /*
Exemplu 2
Robots.txt pentru phpBB
User-agent: *
Disallow: /cgi-bin/
Disallow: /phpbb/admin/
Disallow: /phpbb/cache/
Disallow: /phpbb/db/
Disallow: /phpbb/images/
Disallow: /phpbb/includes/
Disallow: /phpbb/language/
Disallow: /phpbb/templates/
Disallow: /phpbb/faq.php
Disallow: /phpbb/groupcp.php
Disallow: /phpbb/login.php
Disallow: /phpbb/memberlist.php
Disallow: /phpbb/modcp.php
Disallow: /phpbb/posting.php
Disallow: /phpbb/privmsg.php
Disallow: /phpbb/profile.php
Disallow: /phpbb/search.php
Disallow: /phpbb/viewonline.php
User-agent: Googlebot
# nu indexeaza file-urile care au urmatoarele extensii
Disallow: /*.inc$
Disallow: /*.js$
Disallow: /*.inc$
Disallow: /*.css$
# nu indexeaza file-urile cu ? in url
Disallow: *mark=*
Disallow: *view=*
# permite google image bot to caute toate imaginile
User-agent: Googlebot-Image
Allow: /*
# permite adsense bot in tot site-ul
User-agent: Mediapartners-Google*
Allow: /*
User-agent: *
Disallow: /stats
Disallow: /dh_
Disallow: /V
Disallow: /z/j/
Disallow: /z/c/
Disallow: /cgi-bin/
Disallow: /viewtopic.php
Disallow: /viewforum.php
Disallow: /index.php?
Disallow: /posting.php
Disallow: /groupcp.php
Disallow: /search.php
Disallow: /login.php
Disallow: /post
Disallow: /member
Disallow: /profile.php
Disallow: /memberlist.php
Disallow: /faq.php
Disallow: /templates/
Disallow: /mx_
Disallow: /db/
Disallow: /admin/
Disallow: /cache/
Disallow: /images/
Disallow: /includes/
Disallow: /common.php
Disallow: /index.php
Disallow: /memberlist.php
Disallow: /modcp.php
Disallow: /privmsg.php
Disallow: /viewonline.php
Disallow: /images/
Disallow: /rss.php
User-agent: Googlebot
# nu permite indexarea file-urilor cu extensia
Allow: /sitemap.php
Disallow: /*.php$
Allow: /sitemap.php
Disallow: /*.js$
Disallow: /*.inc$
Disallow: /*.css$
Disallow: /*.txt$
# nu permite indexarea file-urilor cu ? in url
Disallow: /*?*
Disallow: /*?
# nu permite indexarea file-urilor din /wp- directorys
Disallow: /wp-*/
# nu permite indexarea arhivei site-ului
User-agent: ia_archiver
Disallow: /
# permite google image bot sa caute toate imaginile
User-agent: Googlebot-Image
Allow: /*.gif$
Allow: /*.png$
Allow: /*.jpeg$
Allow: /*.jpg$
Allow: /*.ico$
Allow: /*.jpg$
Allow: /images
Allow: /z/i/
# permite adsense bot in tot site-ul
User-agent: Mediapartners-Google*
Allow: /*
- Pentru a realiza un astfel de fisier deschideti notepad si salvati fisierul robots.txt.
- “User -agent:*” se refera la toate motoarele de cautare.