need for speed!

I have a feeling this will be a dificult question.my problem is that my pixel setting routine is too slow:

PSET is the original pixel setting routine, but of course, we all know how slow it is.then there's GET & PUT, quite fast, but no way to manipulate the sprites well for scrolling, special effects, etc.

then there's the POKEing method, where you POKE a color value into the VRAM (SEG = A000), at an offset that represents the pixel coordinates(POKE 320& * Y% + X%)this is very fast, (at least two times faster than PSET.)and even faster in compilation.and is what I have been using so far.but... it seems it's STILL too slow, is there a faster way of plotting pixels?should I do something different than plotting for drawing large scrolling backgrounds?I want to do this without libraries. but ASM in the code is ok with me.I've been trying to figure out the assembly equivelent to this code.it seems easy, but I don't have TASM or NASM, so DEBUG:A is what I got.and MOV [A000]:AX, BX doesn't work in debug, (what some one suggested to me).I'm sorry for the long post, but if anyone can help me, then I would appreciate it greatly

I downloaded this program some years ago.The memcopy func is almost as mine above.Check it out:[code]' Bmpload Version 2 By Doug Barry (PD Computers)' This is the fastest BMP Loader I have ever seen, and I wrote it !!!!' God it's fast, anyway it uses assembler to copy a variable to the another' part of the memory, here I have used it to put the image data into the' bit of the Physical ram that overlaps the video memory, hence loading' the picture in one vertical blank space (one 50Hz cycle)'' Thanks load to Dan Holmes for finding out the memory copying routine, and' Andrew Griffin/Jon Sutton for info on the BMP structure.' Also to the guy/gal that posted ShowBMP9.bas on the net and to the' guy/gal that wrote CPLASMA.BAS for the "OUT" command for palette setting.

' Enjoy, even though it's uncommented it should be easy to understand,' being only 65, yes count 'em 65 lines of code (WOW!!!!!!!!).

hi , have you tried to use assembly language ?it is very fast . contact me to have the dma-straight-to-screen-kill-ya-momma-with-an-axepixel routine (it is 48 microprocessor clock cycles) hoping you know how to include assembly in qbasic.