Researchers From HPC-AI Technology Inc. and the National University of Singapore Introduce โ€˜Colossal-AIโ€™: A PyTorch-Based Deep Learning System For Large-Scale Parallel Training

Deep learning models are already revolutionizing the way we think about AI. One such type is the โ€˜transformer model,โ€™ which takes an attention mechanism that differentiates between each part of input data with increased weighting given to those parts deemed most important โ€“ itโ€™s used primarily in NLP and Computer Vision CV (1).

The larger model sizes that come with better performance significantly impact the memory wall of current accelerator hardware, such as GPU. Training large models such as the Vision Transformer, BERT, and GPT on a single GPU or machine can be arduous. AI researchers are constantly trying to find ways for their models to be used in a distributed environment. But distributed environments often require domain expertise in computer architecture and system design, which is challenging to acquire without experience or knowledge from working hands-on with these topics.

Researchers from HPC-AI Technology Inc. and the National University of Singapore (NUS) have introduced โ€œColossal-AI,โ€ a PyTorch-based open-source system that makes distributed training in AI much more accessible for all.

Colossal-AI allows users to set up combinations of data, pipeline, sequence, and multiple tensor parallelism. A user can use tensor parallelism to make a distributed model. This is just like how they make a single-GPU model. In this work, the researchers separated the model building from how it is distributed. They support many types of models, including 2D, 2.5D, and 3D tensor parallelism, sequence parallelism, and activation checkpointing.

Paper | Github| Quick 3 Min Read

๐Ÿ‘︎ 5
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/techsucker
๐Ÿ“…︎ Nov 01 2021
๐Ÿšจ︎ report
Data science training for official statistics: A new scientific paradigm of information and knowledge development in national statistical systems content.iospress.com/downโ€ฆ
๐Ÿ‘︎ 2
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/QueeLinx
๐Ÿ“…︎ Oct 26 2021
๐Ÿšจ︎ report
Stryker Mobile Gun System M1128 (MGS). Sand bag camouflage (US National Training Center, Fort Irwin CA) 2015.
๐Ÿ‘︎ 34
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/Alxmac2012
๐Ÿ“…︎ Dec 27 2020
๐Ÿšจ︎ report
Today on the filmstrip soundtrack cassette preservation marathon: National parks, youth killers, consumer education, and a Chevrolet dealer training system from (1979). youtu.be/PWOA-9VzIao
๐Ÿ‘︎ 36
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/uncommonephemera
๐Ÿ“…︎ Sep 14 2020
๐Ÿšจ︎ report
Source of this slide was from a webinar by the American Hospital Association by Dr James Lawer on 26 February. NETEC is the national Ebola training center, which was established after the 2014 Ebola crisis to help healthcare systems prepare for Ebola and other novel, emerging threats. imgur.com/AebP0L8
๐Ÿ‘︎ 17
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/whereshegoes
๐Ÿ“…︎ Mar 07 2020
๐Ÿšจ︎ report
Canada Child Benefit still needed alongside national daycare system, minister says nationalpost.com/news/canโ€ฆ
๐Ÿ‘︎ 183
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/Lotushope
๐Ÿ“…︎ Dec 29 2021
๐Ÿšจ︎ report
Acevedo training with the national team
๐Ÿ‘︎ 316
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/retrolaw3
๐Ÿ“…︎ Dec 02 2021
๐Ÿšจ︎ report
Under-15 Menโ€™s National Team Kicks Off 2022 With 36-player Training Camp In Chula Vista, Calif. ussoccer.com/stories/2022โ€ฆ
๐Ÿ‘︎ 54
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/ThingEnthusiastt
๐Ÿ“…︎ Jan 07 2022
๐Ÿšจ︎ report
Are there any restrictor gates out there that look the Arc System Works training mode gate? It seems like square, octo, and circle are the only gates that exist.
๐Ÿ‘︎ 55
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/ClashmanTheDupe
๐Ÿ“…︎ Dec 15 2021
๐Ÿšจ︎ report
While training on a new inmate ID system in 1998, I decided to make my educational attempt a keepsake for my grandkids. Hopefully they remember that I had a sense of humor.
๐Ÿ‘︎ 111
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/rebelshirts
๐Ÿ“…︎ Nov 30 2021
๐Ÿšจ︎ report
FC Bayern Munich player tests positive for COVID-19. The national team's training has been cancelled for today. spiegel.de/sport/fussballโ€ฆ
๐Ÿ‘︎ 374
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/outrageously_smart
๐Ÿ“…︎ Nov 09 2021
๐Ÿšจ︎ report
Martin B-26G Marauder 43-34581 at ther National Museum of the USAF in Dayton, OH. Though painted as a B-26B that flew with the 9th Air Force's 387th Bomb Group, this example served with the Free French Air Forces and was acquired from Air France's mechanic's training school in 1965. reddit.com/gallery/r9qmon
๐Ÿ‘︎ 274
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/MrPlaneGuy
๐Ÿ“…︎ Dec 05 2021
๐Ÿšจ︎ report
TA3 Hellish Training Arena - Ryuunsai system youtu.be/fdksW-gUlJc
๐Ÿ‘︎ 50
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/BorderColliex
๐Ÿ“…︎ Dec 06 2021
๐Ÿšจ︎ report
The Steamtown National Historic Site is a national park and museum in downtown Scranton. It is a complex of buildings built between 1899 and 1932 featuring a functioning roundhouse and turntable, and tours on the last remaining line of Scranton/Wilkes-Barre's once massive trolley & El train system.
๐Ÿ‘︎ 307
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/NockNooty138
๐Ÿ“…︎ Dec 27 2020
๐Ÿšจ︎ report
A US Air Force F-16C Fighting Falcon aircraft assigned to the 160th Fighter Squadron, Alabama Air National Guard, releases a GBU-24A/B 2,000-pound laser guided bomb over the Utah Test and Training Range during exercise Combat Hammer, an air-to-ground Weapons Systems Evaluation Program, 30/07/2002
๐Ÿ‘︎ 13
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/PUTINKAAA
๐Ÿ“…︎ Dec 12 2019
๐Ÿšจ︎ report
Xiaomi Mobile will pre-install the National Anti-Fraud Center (ๅ›ฝๅฎถๅ่ฏˆไธญๅฟƒ)app on its new phones with the latest system (MIUI13) reddit.com/gallery/rryqh9
๐Ÿ‘︎ 97
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/Koigggyear
๐Ÿ“…︎ Dec 30 2021
๐Ÿšจ︎ report
Thirty-three sniper teams participate in the 51st Winston P. Wilson and 31st Armed Forces Skill at Arms Meeting Sniper Rifle Matches hosted by the National Guard Marksmanship Training Center at the Fort Chaffee Joint Maneuver Training Center, Barling, Arkansas, Dec. 4-9. reddit.com/gallery/r9zp1z
๐Ÿ‘︎ 111
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/ForcesProject
๐Ÿ“…︎ Dec 06 2021
๐Ÿšจ︎ report
[WTS] SuperSpeed Men's Golf Training System

SOLD

Excellent, like-new condition. Barely used.

Asking $150 shipped (US, lower 48 states)

https://superspeedgolf.com/collections/all-products/products/mens-set-2-0

https://preview.redd.it/edvvqmt4yy681.jpg?width=768&format=pjpg&auto=webp&s=96cd7ebb4899f87b6d8b6173a47ee9ce71b759da

https://preview.redd.it/v2n04rf5yy681.jpg?width=1024&format=pjpg&auto=webp&s=cbab4953a4520fffe538a81e01917138c0f3dbd0

๐Ÿ‘︎ 6
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/babaganoush012377
๐Ÿ“…︎ Dec 21 2021
๐Ÿšจ︎ report
A member of the 1ST Battalion, 108th Armor, 48th Brigade, Georgia National Guard, rests on the turret of his Multiple Intergrated Laser Engagement System (MILES) equipped M-60A3 main battle tank during a training exercise, 15/07/1983
๐Ÿ‘︎ 3
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/PUTINKAAA
๐Ÿ“…︎ Dec 23 2019
๐Ÿšจ︎ report
Norwegian national flag flown as the ensign on the ocean research/training ship Statsraad Lehmkuhl. reddit.com/gallery/ryq7kw
๐Ÿ‘︎ 61
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/lukeETERNITY
๐Ÿ“…︎ Jan 08 2022
๐Ÿšจ︎ report
NRA is proposing a National School Shields Safety Program for all schools free of charge provided by the NRA. The plan is to have armed security in every school, security systems, and training for teachers and students. cbs12.com/news/top-storieโ€ฆ
๐Ÿ‘︎ 116
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/forceduse
๐Ÿ“…︎ Dec 21 2012
๐Ÿšจ︎ report
Researchers from China have developed an economical method for creating GPT-3-style Natural Language Processing systems while avoiding the increasingly prohibitive expense of time and money involved in training up high volume datasets unite.ai/creating-a-gpt-sโ€ฆ
๐Ÿ‘︎ 134
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/Dr_Singularity
๐Ÿ“…︎ Nov 09 2021
๐Ÿšจ︎ report
National Missing and Unidentified Persons System is providing free training on reservations

By: Ruth Milka - November 12, 2019

Read the article here: https://www.nationofchange.org/2019/11/12/national-missing-and-unidentified-persons-system-is-providing-free-training-on-reservations/

In an effort to combat the epidemic of missing and murdered indigenous women and girls, specialists are leading free training sessions on how to use the National Missing and Unidentified Persons System (NamUs) at reservations throughout the country.

This past week Assistant U.S. Attorney and member of Montanaโ€™s Missing Indigenous Persons Task Force Jared Cobell led a training session at the Blackfeet Community College. โ€œEverybody can access this system. Itโ€™s not just for law enforcement,โ€ Cobell said. โ€œAll of us can input things into NamUs. If youโ€™re missing a relative or friend, you can create a profile for them, where you can add details and photos of them. And then others can search for that person, too.โ€

NamUs is a free national resource center for missing, unidentified, and unclaimed person cases in the United States. It includes free and secure technology, forensic services (such as fingerprint and DNA analyses), and investigative support from staff. Although people are able to enter their own public information about a missing person case on the public side of NamUs, the information must be vetted and verified before it is made accessible to the public.

In the United States, indigenous men and especially women go missing at disproportionate rates. In 2016 alone there were 5,712 reports of missing American Indian and Alaska Native women and girls. In places such as Montana where Native Americans only make up 6.6% of Montanaโ€™s population, indigenous people make up 25% of the reported missing cases.

Whatโ€™s more, cases of missing indigenous people are often unreported. If they are reported, jurisdictional obstacles and poor resources often get in the way of any real progress being made, leaving families to search for their loved ones on their own. People such as Cobell believe that by providing training for people to use NamUs hopefully more people can utilize the system to find answers.

NamUs will continue throughout the month to host free training sessions at Rocky Boyโ€™s, Fore Belknap, and Fort Peck reservations.

๐Ÿ‘︎ 4
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/NationofChange
๐Ÿ“…︎ Nov 13 2019
๐Ÿšจ︎ report
I was replaced with the training system I created, demoted, offered less pay, and they had the audacity to be shocked when I quit after being bullied over the scheduling for two months.

I've been considering sharing this story for a while now. It's not something I got to discuss much after it happened, but I think this community will appreciate it.

I worked for Chick-fil-A for about five years. It's been a hot minute, so I don't feel bad name dropping them here. I wasn't especially religious, but they were willing to hire me when I was freshly moved into the state, and they offered good pay. And I was ambitious at the time. I moved up quickly in the first couple of years, and eventually became the Training Director. That meant that I knew how to do every position in the store (front of house AND back of house), as well as how to effectively train for every position. I was in charge of on-boarding the new hires, setting up their training schedules, and making sure they were ready to be scheduled in position after two weeks. And I'll go ahead and toot my own horn - I was damn good at what I did. A few months after the promotion, our retention rate leapt, as did our sales, drive thru numbers, and ratings online. I won't attribute all of it to my training; we hired on some fantastic people in that time. But I will say that my training team helped those people learn quickly and feel confident in what they were doing.

After being in the position for a while, I developed a new training system to help my team and the shift leads train more effectively. We had training iPads, and I used an app to develop a system that allowed for more streamlined tracking of each new hire's training, online and in person. My team picked it up pretty quickly, and our numbers continued to rise. More team members were asking to become trainers and work under me, and the turnover was the best it had been since I'd started working there.

About a year into having this position, there was a small retreat for all the directors. On this retreat, our store operator announced that the director positions were going to be salaried, as well as receive some fantastic benefits. I was ECSTATIC. As a college student, this was huge. My pay would effectively jump by about $10hr. I'd be able to replace my crappy car finally. I could pay for school out of pocket instead of loans. Everything was going to get so much easier. Plus - no more overtime in order to make ends meet! I could focus on school more.

A week after the retreat, my direct boss (one position below our operator) pulled me aside. In a very short, blunt conversation, she informed me that to cut costs for the store, she

... keep reading on reddit โžก

๐Ÿ‘︎ 27
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/arthur-morgan2
๐Ÿ“…︎ Dec 10 2021
๐Ÿšจ︎ report
Eric Berger: Jeanette Epps "remains assigned to NASAโ€™s Boeing Starliner-1 ... (but) is cross training with the team on the Crew Dragon system." twitter.com/SciGuySpace/sโ€ฆ
๐Ÿ‘︎ 246
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/CProphet
๐Ÿ“…︎ Oct 07 2021
๐Ÿšจ︎ report
Could training books also use a toggle system similar to loot scrolls?

Would be a nice QOL change since there's nothing worse than popping your books but realizing you want to do something else.

๐Ÿ‘︎ 31
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/NewMetaMessiah
๐Ÿ“…︎ Dec 04 2021
๐Ÿšจ︎ report
How will national squads adapt their training for LA 2028 given the racing there will be over 1500m?

Given the fact that at the LA 2028 Olympics, the rowing regatta will be over a 1500m course (source, somewhat ironically a row2k article) instead of the standard 2000m, how do you predict national squads will change their preparation for this, if it all? How different is the approach to a 1500m to a 2000m physiologicallly? Do you think squads might, for example, do 1500m erg tests for that Olympiad instead of the normal 2k? I'd assume that the events leading up to the Olympics like the World Cups would still be over 2K, so how do you think squads would approach those? Interested to hear any thoughts.

๐Ÿ‘︎ 72
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/mdmeaux
๐Ÿ“…︎ Nov 21 2021
๐Ÿšจ︎ report
What are good embedded system training with certificates that you could recommend? I am leaning more towards embedded Linux since it it what my current job is.

I recently received my Christmas bonus and I want to invest it to get some certifications and training. I would like to ask for your recommendations for which ones are worth it. It would be best for me if it is something leaning towards embedded Linux. It would also be nice if there are projects within the course, I allocated money to buy boards and sensors to build such projects. Thank you!

๐Ÿ‘︎ 37
๐Ÿ’ฌ︎
๐Ÿ“…︎ Dec 11 2021
๐Ÿšจ︎ report
National Missing and Unidentified Persons System is providing free training on reservations reddit.com/user/NationofCโ€ฆ
๐Ÿ‘︎ 2
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/NationofChange
๐Ÿ“…︎ Nov 13 2019
๐Ÿšจ︎ report
[Jared Isaacman] We have been tracking it from beginning..Design & testing in Hawthorne..to the systems & training procedures..to the flight-ready hardware that shipped to KSC. A few weeks in clean room we saw fully assembled module w/ cupola installed on Dragon. @SpaceX is an incredible company. twitter.com/rookisaacman/โ€ฆ
๐Ÿ‘︎ 1k
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/tonybinky20
๐Ÿ“…︎ Aug 17 2021
๐Ÿšจ︎ report
Shall we discuss the new training system?

Friends. We've noticed your great interest in the new soldier training system that we've added to replace the outdated Academy, and we want to compare it with the old mechanics using a clear example.

The main and most important thing. The idea behind the new system is simplicity. We wanted to get rid of all the unnecessary randomness of getting new soldiers, hundreds of different soldiers with different specialties in reserve, with a long and confusing path to upgrade.

And we definitely succeeded! Now you will immediately see the price that you need to pay to train your soldier. Some of you are intimidated by these high numbers, but let's calculate together whether it's more expensive than the old one.

https://preview.redd.it/50u756e5hht71.png?width=704&format=png&auto=webp&s=36ccd4c6ac6caf41fa517afcc97732a726ef535d

In the new system, you need a lot, but MUCH less soldiers than in the old system. After all, now you won't lose 2 out of 3 soldiers sent to the Academy for training. You are training the soldier that you want to be upgraded. But how much did it actually cost in the old system?

In the old system for training one soldier of any rank from level IV to level V, you would have to use 54 first level soldiers in the Academy. This is a lot and takes a lot of time, even if you don't take into account the large randomness of obtaining soldiers from bronze orders(one out of 12 classes and 3 tiers).

In the new system, you see a clear price for the training of a particular soldier. For example:

Upgrading a soldier Tier I from level IV to V costs 18 bronze orders.

Upgrading a soldier Tier II from level IV to V costs 54 bronze orders.

Upgrading a soldier Tier III from level IV to V costs 72 bronze orders.

And the main thing. Taking into account the fact that, in this update we DOUBLED the speed of receiving Bronze orders, your costs accordingly also drop in half.

---

The last question that worries you sounds like this: "Where can beginners get soldiers if they don't get them from Bronze orders?"

Receive them for Silver orders. Since in the new system you donโ€™t need to donate soldiers for training, there is no need to call in hundreds of soldiers. For an average squad, 3-4 soldiers will be enough to fully staff it in, and according to our calculations, this won't be difficult.

๐Ÿ‘︎ 48
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/Marc1p4n
๐Ÿ“…︎ Oct 14 2021
๐Ÿšจ︎ report
The ship is sinking! Pedos can't hide forever - DOJ Says CNN Producer Raped 9-Year-Old While Her Mother Watched, Solicited Girls Age 9-16 For Sex, 'Virtual Training' Sessions - National File
๐Ÿ‘︎ 17
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/thanosied
๐Ÿ“…︎ Dec 13 2021
๐Ÿšจ︎ report
Sentry Bot at the National Guard Training Yard (RANT)

The Sentry Bot at the National Guard Training Yard is making me lose my mind. You can't do anything about it to make it better.

I placed mines in front of it's door so that it would be damaged enough to kill and guess what? It teleports past them, magically.

I cheesed it and KILLED IT using all of my explosives before entering the armory... and GUESS WHO WAS WAITING OUTSIDE FOR ME?

YEAH. WHAT?

I'm literally running away in full power armor popping jets, med-X, and stim packs just to get away. Infuriating. Anyways, end rant. Thankyou.

๐Ÿ‘︎ 117
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/MclaurinPCBuilds
๐Ÿ“…︎ Oct 22 2021
๐Ÿšจ︎ report
Just got done being told "you have a lot of nerve keeping a wild animal as a pet" Sakar is from a nationally renowned breeder, national grand champion dame, two grand champion siblings in breed and all breed. She has handler, companion and agility training. Otherwise, yeah she's wild! hilarious.
๐Ÿ‘︎ 123
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/huskysizeguy99
๐Ÿ“…︎ Nov 14 2021
๐Ÿšจ︎ report
The Venezuelan National Guard has officially confirmed the training of soldiers of its rapid response unit by instructors of the Russian PMCs "V.E.G.A", formerly known as Vega Strategic Services (VSS). Venezuela
๐Ÿ‘︎ 55
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/Nihilist911
๐Ÿ“…︎ Dec 29 2021
๐Ÿšจ︎ report
Reservation System To Be Tested At Arches National Park Next Year nationalparkstraveler.orgโ€ฆ
๐Ÿ‘︎ 122
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/Synthdawg_2
๐Ÿ“…︎ Dec 11 2021
๐Ÿšจ︎ report
Mikey Rukus: โ€œEvery pro wrestler on this planet deserves to have their very own entrance theme. The kid coming out of training school is just as important as the world champion whoโ€™s on national television. If you are a producer and donโ€™t believe this, youโ€™re starting off on the wrong foot.โ€ twitter.com/mikeyrukus/stโ€ฆ
๐Ÿ‘︎ 1k
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/Weezy-NJPW_Fan
๐Ÿ“…︎ Sep 10 2021
๐Ÿšจ︎ report
Thoughts on the National RV Training Academy (NRVTA)?

Hello all,

I came across this place and wondered if anyone has experiences/opinions with it they're willing to share. So far, I've found overwhelmingly positive reviews and testimonials and am just trying to see if it's for real. I'm definitely interested in going, as I'll be a full-timer soon and feel, at the least, it would be boost my aptitude for all the problems that I'll come across. I also like the idea of making an LLC and doing mobile inspections/repairs.

At any rate, hope this is the right place to post. Thanks!

๐Ÿ‘︎ 6
๐Ÿ’ฌ︎
๐Ÿ“…︎ Dec 08 2021
๐Ÿšจ︎ report
Iโ€™m a controls engineer who never received any formal training with the systems I use, Allen Bradley, ABB, Emerson. Iโ€™s there any value in taking the formal training classes provided by these companies?

For instance my favorite system is Allen Bradley line of PLCs, I have no problem reading function block or ladder logic but couldnโ€™t set up a system from scratch. Iโ€™ve only had to add items like alarms and logic to open/ close valves simple stuff. I can troubleshoot but their is still a lot of knowledge I am missing. I know their are cheaper options but has anyone taken the courses offered by Allen Bradley? I may be able to get my company to pay for a course here and there. My title is controls engineer and I deal with instrumentation, actuators, motor controls, and electrical projects. PLCs are a good portion of my jobs but I donโ€™t do any design from scratch work, but I would like to eventually. We bid projects out but there are some things I believe I could do to save the company some money once I develop some more skills, like my predecessor who retired. Right now I mostly develop project scope and trouble shoot random issues that might come up.

๐Ÿ‘︎ 12
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/Mikecool51
๐Ÿ“…︎ Nov 11 2021
๐Ÿšจ︎ report
[Twitter] Thiago Almada in training with Lionel Messi and the Argentine national team twitter.com/mls_buzz/statโ€ฆ
๐Ÿ‘︎ 125
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/MLS_Buzz
๐Ÿ“…︎ Nov 18 2021
๐Ÿšจ︎ report
UK National Crime Agency (police) officers drag a 'suspect' out of his car during a training exercise [1908x1146]
๐Ÿ‘︎ 224
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/gopniksquatting
๐Ÿ“…︎ Nov 07 2021
๐Ÿšจ︎ report
Researchers From HPC-AI Technology Inc. and the National University of Singapore Introduce โ€˜Colossal-AIโ€™: A PyTorch-Based Deep Learning System For Large-Scale Parallel Training

Deep learning models are already revolutionizing the way we think about AI. One such type is the โ€˜transformer model,โ€™ which takes an attention mechanism that differentiates between each part of input data with increased weighting given to those parts deemed most important โ€“ itโ€™s used primarily in NLP and Computer Vision CV (1).

The larger model sizes that come with better performance significantly impact the memory wall of current accelerator hardware, such as GPU. Training large models such as the Vision Transformer, BERT, and GPT on a single GPU or machine can be arduous. AI researchers are constantly trying to find ways for their models to be used in a distributed environment. But distributed environments often require domain expertise in computer architecture and system design, which is challenging to acquire without experience or knowledge from working hands-on with these topics.

Researchers from HPC-AI Technology Inc. and the National University of Singapore (NUS) have introduced โ€œColossal-AI,โ€ a PyTorch-based open-source system that makes distributed training in AI much more accessible for all.

Colossal-AI allows users to set up combinations of data, pipeline, sequence, and multiple tensor parallelism. A user can use tensor parallelism to make a distributed model. This is just like how they make a single-GPU model. In this work, the researchers separated the model building from how it is distributed. They support many types of models, including 2D, 2.5D, and 3D tensor parallelism, sequence parallelism, and activation checkpointing.

Paper | Github| Quick 3 Min Read

๐Ÿ‘︎ 6
๐Ÿ’ฌ︎
๐Ÿ‘ค︎ u/techsucker
๐Ÿ“…︎ Nov 01 2021
๐Ÿšจ︎ report

Please note that this site uses cookies to personalise content and adverts, to provide social media features, and to analyse web traffic. Click here for more information.