Problem Set 4 Harvard Extension School CSCI E-93: Computer Architecture Fall 2024 Due: October 27, 2024 at Midnight ET (Eastern Time) Total of 270 Points Please submit your solution to this problem set using "git" with named branch problem-set-4. For this assignment, based on your architecture design from Problem Set 2 with appropriate modifications made by you based on comments and design changes, implement all of the following programs. 1. (100 Points) Write an assembler for your assembly language that will translate programs (all of the instructions and any data) in symbolic form into hexadecimal 16-bit words by location in memory to be emitted as a file in the Altera Memory Initialization File (.mif) format (see the link at: https://cscie93.dce.harvard.edu/fall2024/def_mif.htm). Note that for the DE2-115, the value for the DEPTH parameter must be 32768, the WIDTH parameter must be 16. These constraints are necessary because the memory system we are providing allows only the low 64K bytes of memory to be initialized in the DE2-115. You are free to design your assembly language as you wish. Attributes of traditional assembly languages are described in the document named "Assembler Concepts (a.k.a. Some Assembly Is Required)" on the class web site which is accessible at https://cscie93.dce.harvard.edu/fall2024/slides/Assembler%20Concepts.txt. Your assembler should allow forward references to be used in the symbolic program. You may choose the appropriate number of passes to be made over the symbolic assembler program in order to produce your hexadecimal output file. One pass may require fixups, two passes are sufficient, three or more passes may allow optimization to be performed if short and long branches are allowed in your instruction set. You may write the assembler in C, C++, or Java. If you'd like to use a different implementation language, please check with the instructor. Extra Credit: You can receive extra credit for implementing one or more enhancements to your assembler. These include: (1) Allow many different kinds of integral constants in your assembly programs. For example, this might include decimal, hexadecimal, octal, and ASCII character constants. (2) In addition to producing the .mif output, have your assembler optionally produce a listing file that would include the original assembly language input along with the address where that line's code/data will be stored and the hexadecimal representation of the machine code generated for that line. So, a line in a listing file might look like: 0x00000024: 0x35280011 loop: ori $t0, $t1, 17 In this example, 0x00000024 is the address in memory where the instruction will be placed and 0x35280011 is the machine code that was generated from the assembly language statement. (3) Allow simple expression evaluation at assembly time. This would be a feature that allows an assembly language programmer to write arithmetic or logical expressions that can be evaluated to a constant at assembly time. For example, it might allow MIPS instructions to be written as follows: loop: ori $t0, $t1, 34+42 lui $t2, loop>>16 (4) If you have experience in expression evaluation (from taking a compiler design class, for example), allow more advanced expression evaluation at assembly time. This might include dealing with operator precedence and more operators such as, parentheses as follows: addi $t3, (127+1)/2 (5) Allow labels to also be used as symbolic constants equal to an integral value. This feature allows a programmer to use a symbolic name for a constant integral value. A usual way to implement this feature is as follows: bufferSize: .equ 128 Then, the label "bufferSize" may be used wherever an integral constant would be legal in your assembly language as follows: main: ori $t4, $0, bufferSize .space bufferSize (6) Allow some number of useful assembler pseudo-instructions. After writing all of the following subroutines, ensure that your assembler is able to successfully translate the complete program (all of the instructions and any data) into a MIF file (that is, hexadecimal bytes by location in memory). You should assume that the computer starts execution at location zero in memory. Each solution should be implemented as a subroutine (function, procedure). You should deal with the I/O devices using the memory-mapped interface defined in the I/O specification document: https://cscie93.dce.harvard.edu/fall2024/io_interface.txt. If your architecture allows both byte and 16-bit or 32-bit word accesses to memory, keep in mind that the memory system we are providing is organized as little endian. 2. (10 Points) Write a subroutine in your assembly language to output a single character on the output device. The character may be stored in a register or in memory as appropriate for your architecture. 3. (10 Points) Write a subroutine in your assembly language to input a single character from the input device. The character will be returned in a register or in memory as appropriate for your architecture. 4. (15 Points) Write a subroutine in your assembly language to output a sequence of characters (i.e., a string) stored in memory on the output device. This subroutine should call the subroutine in part 2 above in order to output each character. You should determine how you will be representing strings in your architecture. For example, the string could be either null-terminated or its length could be specified. 5. (15 Points) Write a subroutine in your assembly language to read an enter-terminated (i.e., line-feed) sequence of characters from the input device. This subroutine should call the subroutine in part 3 above in order to input each character. The string should be stored in memory after being read. Use the same representation for a string that you defined in part 4 above. 6. (25 Points) Write a subroutine in your assembly language to output a signed decimal integer on the output device. Prior to outputting the value of the number, it will be stored as a signed 2-byte (16-bit) integer or as a signed 4-byte (32-bit) integer in two's-complement representation. The integer may be stored in a register or in memory as appropriate for your architecture. This subroutine should either call the subroutine in part 2 or part 4 above in order to output the signed decimal integer. The minimum requirement is to be able to be able to output a signed decimal integer between -32768 and 32767, inclusive. If you are implementing an architecture that supports integers of size greater than 2 bytes (16 bits), then you must be able to display the entire range of values represented by your integers. Extra Credit: You are welcome to implement extended precision in software (for example, even on a 16-bit word size computer you could allow output of either a 32-bit doubleword integer or a 64-bit quadword integer), but that is not required. 7. (25 Points) Write a subroutine in your assembly language to input a a line containing a signed decimal integer from the input device. This subroutine should call the subroutine in part 5 above in order to input the line. The value of the number should be converted into a signed 2-byte (16-bit) integer or into a signed 4-byte (32-bit) integer in two's complement representation. The integer will be returned in a register or in memory as appropriate for your architecture. The minimum requirement is to be able to be able to input a signed decimal integer between -32768 and 32767, inclusive. If you are implementing an architecture that supports integers of size greater than 2 bytes (16 bits), then you must be able to input the entire range of values represented by your integers. No range checking is required on the input integers; therefore, if an input integer is not between -32768 and 32767, inclusive, then the correct value does not need to read as an integer and no error message needs to be displayed. Extra Credit: You are welcome to implement better input error checking and you are welcome to implement extended precision in software (for example, even on a 16-bit word size computer you could allow input of either a 32-bit doubleword integer or a 64-bit quadword integer), but that is not required. 8. (50 Points) Write a subroutine in your assembly language to multiply two signed integers (the multiplicand multiplied by the multiplier) and return the resulting signed integer product. Even if your hardware will include a hardware multiplier as a special feature, you should not use your multiply instruction in your solution to this problem. Your solution must be more efficient than just repeated addition and should implement a shift-and-add approach that is traditionally taught for "long multiplication." The minimum requirement is for the multiplicand, the multiplier, and the product to all be 16-bit signed integer words. If the product is not between -32768 and 32767, inclusive, then the product does not need to be correctly computed and no error message needs to be displayed. Extra Credit: You are welcome to implement better error checking and you are welcome to implement extended precision (for example, to multiply two 16-bit words to produce a 32-bit doubleword product or to multiply two 32-bit doublewords to produce either a 32-bit doubleword or a 64-bit quadword product), but that is not required. 9. (20 Points) Write the main program in your assembly language to prompt (i.e., output a string to request input) a user to enter two signed decimal integers and to multiply one by the other and then to output the resulting integer product in decimal. This main program should call the subroutines written above to perform all of the above operations. The integers must be able to represent values from -32768 to 32767, but may represent a larger range of integer values. After outputting the result, the main program should repeat that process forever -- requesting another set of two integers to be multiplied and so forth. Last revised 15-Oct-24