Find Jobs
Hire Freelancers

C++ Cuda kernel for transpose of 2D arrays

$30-250 USD

Completed
Posted over 3 years ago

$30-250 USD

Paid on delivery
A c++ Cuda kernel for a transpose function is to be programmed. The kernel should be state of the art, which means maybe more than just plain copy (e.g. if useful, with coalescent memory access for both read and writing). The main goal is speed!!! It will be used with an RTX 2080 card. The input data consists of a 2D-array (an image) with float numbers. Mainly, the size of X and Y are not equal, not a multiple of 256 and varying. An example of how to use the kernel needs to be given, e.g. load an image with the Nvidia-SDK, transpose it and save it.
Project ID: 26894759

About the project

3 proposals
Remote project
Active 4 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
Awarded to:
User Avatar
Hi, I’m Cuda programmer. I’m familiar with this job. Transpose cuda 2d use shared memory with arbitrary size.
$167 USD in 1 day
5.0 (3 reviews)
3.4
3.4
3 freelancers are bidding on average $248 USD for this job
User Avatar
Hi, I’ve expertise in image processing and Cuda kernels. Have been using Cuda extensively for parallel processing on large datasets.
$278 USD in 3 days
5.0 (1 review)
3.2
3.2

About the client

Flag of GERMANY
Jena, Germany
5.0
19
Payment method verified
Member since Aug 3, 2018

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.