Teradata Architecture


The biggest strength of the Teradata is the parallelism. So the architecture of the Teradata is designed in such way to keep this strength in mind. Teradata is unique from any other database because of its unique architecture only.

The main components of Teradata architecture is PE (Parsing Engine), AMP (Access Module Processor) and BYNET . We look into these components in details after looking the logical view of the architecture.

The logical view of Teradata architecture is given below –


Parsing Engine (PE) – Whenever a user login to Teradata it actually connect to Parsing Engine (PE). When a user submits a query, then the PE takes action, it creates a plan and instruct AMPs what to do in order to get the result from the query. The PE knows all, it knows how many AMPs are connected to Teradata system, how many rows are in the table and what is the best possible plan to execute the query. This is why the PE is also called as the ‘OPTIMIZER’.

Beside making a perfect plan for query execution PE also make a check on the access right of the user that weather the user has the privilege to execute the query or not. In this way PE also perform security feature on the users.


Access Module Processor (AMP) – Each AMP attached to the Teradata system listens to the PE via the BYNET for instructions. Each AMP is connected to its own disk and has the privilege to read or write the data to its disk. The AMP can be best considered as the computer processor with its own disk attached to it. Whenever it receives the instructions from the PE it fetches the data from its disk and sends it to back to PE through BYNET. Each AMP is allowed to read and write in its own disk ONLY. This is known as the ‘SHARED NOTHING ARCHITECTURE’. Teradata spreads the rows of the table evenly across all the AMPs, when PE asks for data all AMPs work simultaneously and read the records from its own DISK. Hence a query will be as slow as the slowest AMP in the system. This is known as parallelism.

BYNET – The BYNET is the communication channel between PE and AMP. It ensures that the communication between PE and AMP is correct and on right track. In Teradata system there are always two BYNET system. They are called as ‘BYNET 0’ and ‘BYNET 1’. But  we refer them as a single BYNET system. The reason two BYNET exist on a Teradata system is that –

1)      If one BYNET fails, the second BYNET takes over it place.

2)      Two BYNET improve the performance of the system, the PE and AMP can talk to each other over both BYNET which fasten the communication.


Short summary –

  • The PE checks the syntax of the query, check the user security rights
  • Then PE come up with the best optimized plan for the execution of the query
  • The PE passes this plan through BYNET to AMPs.
  • The AMPs follow the plan and retrieve the data from its DISK.
  • Then AMPs passes the data to PE through BYNET.
  • The PE then passes the data to the user.



I have also identified few E books which is immensely useful for Teradata Certification Preparation. Please have a look and if you are seriously preparing for TD certifications then don’t hesitate to purchase it.(Hover the mouse on the below image to get more information about the book)

 

 

97 comments

Skip to comment form

  1. Yo, just checked out 32wingame. Pretty slick site, man! Definitely giving it a look. Check it out yourself 32wingame.

  2. Sinagphcom? Not bad, not bad! The site design is clean and they got the usual stuff. Might be worth a spin if you’re bored. Hit them up at sinagphcom

  3. Found this keo88bet site. Looks interesting, but I’m always skeptical. Any thoughts? Any experiences to share? Here is the link: keo88bet.

    • kartheek on July 19, 2016 at 5:04 am
    • Reply

    hi all,

    Let me know what actually the bynet do? & who integrates the data from these amps&where the metadata stores about these amps(i.e what data that particular amp
    stores.
    thanks

      • Tom on December 15, 2016 at 4:04 pm
      • Reply

      Sorry, the Bynet is proprietary to Teradata. If you want most info on the Bynet, search their public documentation.

    • smriti on May 20, 2016 at 6:43 am
    • Reply

    Hi Admin

    Very nice and crystal clear description for beginners.

    Would be more helpful if i can get some idea on topics on teradata like fallback clusters,cliques, Hashing Algorythims and journals.

    1. please refer our free tutorial here –http://www.teradatatech.com/?page_id=587
      you’ll get most of the topics listed

    • lakshmi on May 2, 2016 at 4:20 pm
    • Reply

    hi

    currently i am working as testing engineer using informatica, oracle. i would like learn teradata for career growth. please suggest me what way to learn basics in teradata. is this relevant t to oracle sql commands or not.

    please suggest me.

    thanks
    sri

    • AkshayP on January 20, 2016 at 2:40 pm
    • Reply

    hi admin
    can you tell me that how many disk can be held by one amp?

      • NAGAMAIYA KARTHICK RAMALINGAM on March 30, 2016 at 5:55 pm
      • Reply

      Each AMP has associated with one disk…

    • Mnagaraja. M on August 30, 2015 at 12:05 pm
    • Reply

    what is differanc between teradata 12 and td 13

    • Nagaraja. M on August 30, 2015 at 12:00 pm
    • Reply

    what is differanc between teradata 12 and 13

    • loki on April 17, 2015 at 10:28 am
    • Reply

    CTRL+p and save the pages

    • Satish on July 22, 2014 at 12:28 pm
    • Reply

    What is the maximum limit of AMP connected to BYNET?

    1. Each BYNET can support maximum 512 nodes.
      In Each Node we can run maximum up to 16384 VPROCS (AMP, PE, Gateways etc).

        • hari arun on June 10, 2016 at 8:13 pm
        • Reply

        what is node here in this case?

    • Shruti on July 7, 2014 at 9:24 am
    • Reply

    Very nice explanation for each component. Should share some more infirmation about Teradata.

  4. Hi All,

    We have started a new special online teradata training batch for developers and DBA profile. For those who are interested to learn teradata can register in this batch. Fees is quite less when you compare it with other batches and biggest advantage is that its a instructor led training batch so you can ask any doubt in the training itself. Also we’ll be covering latest version of Teradata i.e. Teradata 14.10

    Please check following link for more details about this batch –
    http://www.onlineteradatatraining.com/?page_id=96

    Limited Seats. So try to register ASAP.

    • Ramnath Jk on April 9, 2014 at 5:10 am
    • Reply

    If each PE supports up to 120 sessions and each session can run 16 requests in parallel and each AMP can run 80 tasks in parallel, does this mean that we need minimum 24 AMPs ((120*16)/80)???

      • aditya on April 17, 2014 at 12:41 pm
      • Reply

      Hi Admin,

      Could you please reply to this question i am also intrested in knowing the answer of this

      1. No of AMPS used depends on the type of request submitted and type of index used in the query. A request can be a single AMP,group AMP or an all AMP one based on whethere we are using a PI/SI or no index.

        No of AWT’s(AMP worker tasks) used depends on how many steps into which PE/Optimizer disects our query into.

        At most 62 AWT’s can run in an AMP to do user related works, Rest 18 are reserved for internal work types.

        At a single point of time a session can have only one active query(it can either be a single statement or a multi statement query). This single query is divided into multiple steps and sent to AMPS by PE, AMPS distribute these steps among AWT’s to do a portion of the work.

        For understanding an AMP can be compared to a Process and an AWT can be compared to a Thread running under a Process.

    • shashank on March 27, 2014 at 6:56 am
    • Reply

    i need to learn basics of teradata as i am using this in mainframe… can please any body guide me which materials or url shd i use

      • Shivam on May 28, 2014 at 9:23 am
      • Reply

      Shashank,

      Same situation here, Even I use this with MF. Please feel free to contact me – sriv.shivam@outlook.com
      I am using TD v12 with MF, in retail Project

    • anilreddy on March 3, 2014 at 11:08 am
    • Reply

    this is best website ,good work guys.
    keep it .

    • Arpan on January 24, 2014 at 7:35 am
    • Reply

    Awesome Explanation……………………………. Superr Guru

    • Veena on December 10, 2013 at 1:16 pm
    • Reply

    Thank you Sir! Your blog is very helpful, and the users too are intuitive because of which we get to learn new dimensions of a particular topic.
    I have one question-
    How is the number of AMPs , and size of each AMP decided?

    1. Number of AMP and size is decided at the time of configuration of teradata system. Usually it depends on the requirement of project.

    • nilesh mali on December 8, 2013 at 6:27 pm
    • Reply

    Nice Explanation for teradata Architecture

    • Bharath on November 27, 2013 at 9:36 am
    • Reply

    very good explaination on TD for a novice.

    • Indrajeet Patil on November 18, 2013 at 9:39 am
    • Reply

    Very informative…… Great work i must say..!!

    • rajitha on October 21, 2013 at 1:13 pm
    • Reply

    Hi admin.
    I wants to do teradata certification. please guide me

  5. WAT A SITE I NEVER SEEN BEFORE BEST EXPLANATION KEEP IT

    • Jith on September 16, 2013 at 2:35 pm
    • Reply

    Good site

    • NEHA GUPTA on September 13, 2013 at 6:06 am
    • Reply

    hi

    • hima bindu on April 2, 2013 at 9:54 am
    • Reply

    good explanation…

    I hav a doubt…

    according to u…Execution will be slow as much as slowest amp but when one amp fails what happens to execution…
    will it stop the entire execution..

    1. when an AMP fails then teradata data protection features comes in action like (hot standby node, cliques, down amp recovery journal).

      in case these protections are not able to migrate the data of that failed AMP (highly impossible) then your execution will stop until you recover that AMP.

        • Sumit Nigam on April 25, 2013 at 7:56 am
        • Reply

        To further explain 2 points in case they are not clear:
        1. AMP writes to its “local” disk.
        2. In case, an AMP instance fails, a hot-standby can take over.

        Seeing both points together raises a question that if AMP writes locally then how does its local data get replicated to another instance. The way they do it is to actually have a common storage array between both AMP instances. They use iSCSI based SAN here, I think. So, both are sharing this “local” disk array. In case, the primary instance fails, the hot-standby can always be started without any “data-lag”. Of course, with iSCSI SANs also, storage tier can be a SPOF.

        1. It is correct that an AMP can perform I/O ops only on the VDISK/LUNS assigned to it.
          AMPS cannot read/write to VDISKS assigned to other AMPS.

          Coming to Hot Standby Nodes(HSN), They are protection mechanism for NODE failures, they can not protect against AMP failures.
          In case of a Node failure, AMPS get redistributed among other nodes or HSN(if present). For the migrated AMPS to perform operations on their VDISKS, Group of Teradata nodes(Cliques) are configured to access Disk Arrays of other nodes within the group like mentioned in previous comments. So, even if a node fails VPROCS migrated to other Nodes or HSN still can access the VDISKS in the Disk Array configured to the failed node.

          In case of AMP failures, The data of that particular AMP is not accessible which means full table scans and index operations involving failed AMP are not possible. Index accesses on healthy AMPS can still be performed.
          To protect against AMP failures, we have Fallback mechanism which replicates an AMPS data on another AMP. So, In case of an AMP failure, data can still be accessed from the fallback AMP.

      • Vasu on May 12, 2013 at 3:30 am
      • Reply

      Hi Himabindu,
      Can you please share your mail id :vvdrmca@gmaail.com

      • Nagaraja. M on August 30, 2015 at 12:03 pm
      • Reply

      what is differanc between teradata 12 and 13

    • Gourav on February 11, 2013 at 1:04 pm
    • Reply

    Hello Nitin,

    Thanks for putting up extra efforts to share your knowledge with us. keep it up.
    It is really good forum to learn TD concepts.

    Thanks,
    Gourav

    • vivekanand kola on January 31, 2013 at 2:03 pm
    • Reply

    Hi All,

    This is Vivekanand, this is very use full to who want learn teradata very quickly from basics to high level but it is very difficult to copy the data.Can any one give how to copy the data from this website.

    Thanks,
    Vivekanand kola

    1. Hi Vivekanand,

      Thanks for your comment.

      Actually copying content from this site is banned to maintain the originality of content.
      Hope you understand 🙂

        • Mukesh on February 27, 2013 at 8:38 pm
        • Reply

        Admin,

        We should not live in visual reallity . The basic concept for which Internet is created is sharing. I don’t understand when admin says that you can read from here but can’t copy from here .

        It’s just an CSS trick that disables right click which actually do not provide any security beacause any user who want data to copy will do ctrl + U and can copy easilly.

        request to admin this security is of no use and actually it annoys while typing becuase you can’t correct your mistakes by doing right click.

        Mukesh

        1. just disable javascript in ur browser and reload the page. now u shud be able to copy it.

  6. Hello all..i m new in teradata today is my 1st day in this organization..this site helps me alot to increase my knowledge level about this organization..i will make sure that i will be helpful for the company and will help the company by using my skills.. 😉

    • mujeeb on January 22, 2013 at 6:58 am
    • Reply

    Hey every one,i m new in teradata n this site helps me alot to increase my knowledge level about this organization..

    • savitha subbaraman on December 18, 2012 at 4:39 am
    • Reply

    Hi admin,
    this site is really helpfull! can i know about AWT? and in the concept of index why we go for secondary index when we already have primary index?

    1. secondary index is the alternate way to access data. To know more about SI please refer – http://www.teradatatech.com/?p=815

    • nitu on December 13, 2012 at 7:38 am
    • Reply

    Hi All,

    As I am new to TD and after reading the available input on this site really helped me to understand architecture of TD and how it actually works in real world in a very clear way to understand.

    Actually I am an SQL server resource but I am learning TD for informatica but found it more interesting so pls guide how to start with it in a more simpleway to understand it.

    1. if anyone is interested in teradata training for development profile please have a look – http://www.onlineteradatatraining.com/

    • Anju Vishnu on December 12, 2012 at 11:10 am
    • Reply

    I find this site very helpful. All the concepts are presented in a simple manner.

    I have a doubt – Out of Node, PE, BYNET,operating system,CPU and disks , which is a part of teradata open architecture?

      • priya on December 16, 2012 at 3:43 pm
      • Reply

      Hi Anju,

      This question i have faced in TD12 basics exam.. NODE is the part of teradata open architecture.

      1. thats correct 🙂

    • K.V.Koteswararao on December 6, 2012 at 3:20 pm
    • Reply

    Hi,

    First of all Many thanks for your useful post. I am new to teradata DB. Can you please post about SMP and MMP architectures.

    • Raju on September 27, 2012 at 1:52 pm
    • Reply

    Hi ,

    The way you explained the architeture is really great and appricate.

      • Ubendran on November 27, 2012 at 8:07 am
      • Reply

      Nice post for the beginners..

      1. Thanks 🙂

    • Pavan on June 15, 2012 at 4:06 pm
    • Reply

    Hello,

    Very nice post for the begineers..!!

    I write to get the right suggestion from you on my move to Teradata DBA role from Oracle DBA role, I have 6 yrs of experience as an oracle core DBA. I’m very much confused to take the decision. can you let me know the market and the oppurtunities in teradata for a DBA role. ‘ll the move benefit my career?

    Thanks
    Pavan

    1. Hi,

      Honestly speaking i don’t have much idea about ORACLE DBA market so can’t do the exact comparison between both these tech. But as far as TD DBA is concerned, there is huge requirements for DBA’s role, as you must be aware that TD is the emerging tech and most of the big shot clients are shifting towards huge data so its quite obvious that TD market is going strong day by day 🙂

      All the best for your career in TD

      Thanks

        • KRISHNA on July 23, 2013 at 7:01 am
        • Reply

        GOOD ONE

    • venu on May 14, 2012 at 7:30 am
    • Reply

    It’s very detailed and easily understandable for beginners

    • Saurabh on May 10, 2012 at 6:28 pm
    • Reply

    Thank You very much….
    I am very much waiting for more posts from you/

    • ankita on May 7, 2012 at 9:51 am
    • Reply

    good for a beginner

    1. thanks .. will very soon post on advanced Teradata architecture.

    • robert on April 18, 2012 at 11:35 pm
    • Reply

    How do I find out how many AMPS my system has?

      • rahul on April 26, 2012 at 11:19 pm
      • Reply

      we can find the Amps in our system by using hash functions like
      1)hash Amp
      2)hash bucket
      3)hash row.. etc

      the query to find number of amps in our system is
      SELECT HASH AMP( ) + 1;

      here is a question that y we add 1. bcoz it shows one less than the actual amps. the reason is hash amp required the hash bucket value..

      Regards
      Rahul

      1. very well answer Rahul 🙂

        just to add one small point …. the actual syntax of HASHAMP is –
        HASHAMP (< hashbucket >)

        so it will give you the AMP number of the particular hashbuket value.

        but if you wont give any hashbuket value inside the bracket, then it will give the number of AMPs (less than 1) configured in the Teradata system.

    • Pavab on April 3, 2012 at 6:21 pm
    • Reply

    very good explanation
    thanks

    • venkat on March 29, 2012 at 10:55 am
    • Reply

    how many amps will connect to bynet at at time?

    1. at the time all the AMPs in the teradata system are connected to BYNET.

    • Chaitanya Datti on March 14, 2012 at 10:05 am
    • Reply

    very very clear explanation !!!!!!!!

    1. Thanks Chaitanya

  7. Nice post.Thanks for sharing this

    1. Thanks Suhasini …

    • Sunny on February 26, 2012 at 6:51 pm
    • Reply

    Nice explanation, Admin. Appreciated !!

    1. thanks…. 🙂

    • prakash on February 15, 2012 at 3:49 pm
    • Reply

    good one for TD freshers….

    1. Thanks … 🙂

    • Dilip on February 6, 2012 at 7:58 am
    • Reply

    Hi,
    Am new to tera data.can u send me the docs of tera data to my mail id.my mail id is dilip.thokala@gmail.com.

    Thanks & Regards
    Dilip.T

    • Renuka on January 31, 2012 at 12:30 pm
    • Reply

    Am new to teradata..This is very useful for me..Can you send the document for teradata? how to install & how to execute? Send this pdf of doc file to this mail id:renukadevi.ciet@gmail.com . Thanks in advance.

    • PriyaSudheer on January 4, 2012 at 5:53 pm
    • Reply

    We nice and crystal clear explanation….!!

    Dear Admin.. Topics mentioned are clearly expplained and very easy to quick grasp..please explain some more topics that are newly added to TD12 syllubus like (applaences,processing types etc,)
    Expecting positive reponse

    Thanks
    PriyaSudheer

    1. will definitely try to post on TD 12 🙂

    • sudha on January 3, 2012 at 9:40 pm
    • Reply

    very clear explanation

    1. thanks .. 🙂

    • Raki on November 17, 2011 at 6:41 pm
    • Reply

    Really superb explanation!

    1. thanks 🙂

    • vanitha on November 16, 2011 at 7:37 am
    • Reply

    nice explanation

    • mw on October 31, 2011 at 6:14 pm
    • Reply

    Neat. Clean, and I understood it. Thanks!

    • prashant killada on September 6, 2011 at 6:42 pm
    • Reply

    hai!
    That was superb contribution regarding the architectural summary of teradata.
    Thanks and regards
    Prashant killada

    1. Thanks Prashant for the appreciation 🙂

    • Sadashivadas on August 1, 2011 at 11:20 pm
    • Reply

    hi

    1. hi

        • Ghouse on January 11, 2012 at 3:51 am
        • Reply

        Hi Admin,

        it is neat and clear explanation n understandable way.Thanks a lot 🙂

  8. hi Guys
    check for the new set of TD Basic Sample Questions available on my site –
    http://www.teradatatech.com/?p=325

      • Vasu on May 12, 2013 at 3:08 am
      • Reply

      Hi Admin,
      I have one good question on to increasae Max number of sessions on TD 13.00
      Here my Environment is TD database 13.00 and OS :Win -2003..
      And We have 8 AMP’S and 1 PE on TD 13.00

      Now my question is
      We have a requirement of checking 200 simultaneous connection to Teradata.
      Currently we have only able to make max 120 connections. (Because 1 PE =120 Sessions)
      Now we need to increase the max no of sessions for Teradata DB. ?
      Can you please tell em how to increase max no of sessions on TD ..
      Is it possible to increase max no of sessions ?

      1. Vasu,

        It is not possible to raise the standard 120 session limit for a PE.

        Your objective of being able to connect 200 sessions is still achievable if you can add up one more PE to the system. This can only be done with the help of Teradata PS support team as this involves downtime and re-configuration of the entire system.

Leave a Reply to nilesh mali Cancel reply

Your email address will not be published.

This site uses Akismet to reduce spam. Learn how your comment data is processed.